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Precision physics at future colliders requires highly granular calorimeters to support the 
Particle Flow Approach for event reconstruction. This article presents a review of about 
10 - 15 years of R&D, mainly conducted within the CALICE collaboration, for this 
novel type of detector. The performance of large scale prototypes in beam tests validate 
the technical concept of particle flow calorimeters. The comparison of test beam data 
with simulation, of e.g. hadronic showers, supports full detector studies and gives deeper 
insight into the structure of hadronic cascades than was possible previously. 
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I. INTRODUCTION 

A. Role and limitations of calorimeters in high energy 
physics 

Progress in elementary particle physics has been driven 
by advances in detector technology as much as by the in¬ 
creased reach of accelerators. The transition from photo¬ 
graphic event collection to electronic data recording has 
given access to rare processes in high rate experiments, 
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and silicon strip detectors made the picosecond lifetimes 
and oscillations of heavy flavoured particles visible. In 
the past 10 years, calorimeters with imaging capabilities 
have been developed, which opens a new era of precision 
in the measurement of particle jet energies. 

Today’s particle physics experiments, mostly at col¬ 
liders, use multi-purpose detectors aiming at capturing 
as precise and complete information as possible for the 
reconstruction of all particles in the final state of each 
collision - their type and kinematic properties. Whilst 
trackers infer the momenta of charged particles from the 
curvature of ionisation trails in magnetic fields, calorime¬ 
ters obtain the energies of particles only indirectly via 
the debris emerging from their interaction with a block 
of matter, so-called showers. Without calorimeters, the 
reconstruction of jets, which contain many types of par¬ 
ticles, is not complete. 

Future particle physics projects, with hadron, lepton or 
neutrino beams, place ever-increasing demands on the de¬ 
tailed reconstruction of the beam interaction final states. 
For example, the next generation of e+e“ linear collid¬ 


ers, the ILC (Baer et al 2013) with centre-of-mass (c.m.) 
energies of 250 GeV - 1 TeV and its detectors ILD and 
SiD ( Behnke et al] 2013), and CLIC ( Linssen et al] 2012 | ) 
(250 GeV - 3 TeV) with adapted detectors, will enter a 
domain of precision measurements with heavy bosons — 
W, Z and H — being copiously produced. They must be 
reconstructed in multi-jet final states and identified on 
the basis of their invariant mass. The W - Z mass sep¬ 
aration is about 10 GeV. Since the dijet invariant mass 
is given by = 2 EiE 2 {l — cos 6 >i 2 ), where Ei ^2 are 
the energies of the jets and O 12 the angle between them, 
a jet energy resolution of (TeIE translates into a mass 
resolution = {'^1 V^)o'eI if angular uncertain¬ 

ties can be neglected. Taking the natural width of about 
2.7% into account, a 3 cr separation then requires a jet 
energy resolution of 3-4% over a wide range of jet ener¬ 
gies, from 50 to 500 or 1000 GeV, for the ILC or CLIC, 


respectively (Thomson 2009). 


Physics channels particularly sensitive to the jet energy 
performance are those with heavy bosons to be identified 
in dijet final states. The process e+e“ ^WWz^e^e, which 
probes the WW scattering amplitude and which is to be 
separated from its irreducible TILv^v^ background, is an 
often quoted example. Other examples are the measure¬ 
ment of the H ^ WW* branching ratio, which together 
with the cross section for Higgs production in the WW 
fusion channel enters into the determination of the Higgs 
total width. Here both W and Higgs masses are recon¬ 
structed. For ZH final states, where the Z decays into 
neutrinos only the Higgs decaying into a pair of jets is 
visible in clean conditions, which give access to the Higgs 
coupling to charm quarks. Channels with large jet mul¬ 
tiplicity, like ttH, are less affected, since there jet finding 
ambiguities dominate the mass resolution. On the other 
hand, the ZH^ 4 jets final state provides a Higgs mass 


resolution comparable to the recoil mass technique, using 
leptonic Z decay modes, and is sensitive to the calorime¬ 
ter performance, even though kinematic constraints can 
be applied. 

The classical way of measuring jet energies is to sum 
up the energy depositions of all charged and neutral par¬ 
ticles of the jet in the calorimeter system, generally com¬ 
posed of an electromagnetic section (ECAL) followed by 
a hadronic part (HCAL). In this case, the resolution de¬ 
pendence on the energy E approximately follows a form 
^ = s ^ 05 where a, called the stochastic term, re- 

^ y^£;(GeV) ’ ’ 

fleets statistical fluctuations in the shower evolution and 
measurement, and 5, called the constant term, arises from 
imperfections in detector homogeneity, stability and cal¬ 
ibration. At higher energies, there are additional contri¬ 
butions from the fluctuations of non-contained energy, or 
leakage. Typically, the term a attains values of 50 — 100%, 
while 6 is a few percent, which altogether is not sufficient 
to meet the linear collider goals. 

The limitation of the classical approach stems from 
the fact that most of the jet energy 70%) is carried 


by hadrons (Green et al. 1998 Knowles and Lafferty 


1997) and the jet measurement thus inherits the intrinsi¬ 


cally poor performance of traditional hadronic calorime¬ 
try, if no tracking information is used. In contrast to 
the cascades of bremsstrahlung and pair creation initi¬ 
ated by electromagnetic particles, hadronic showers are 
characterised by a much smaller number of subsequent 
nuclear collisions and smaller number of produced par¬ 
ticles, and thus suffer from much larger fluctuations. In 
addition, a large number of different fundamental pro¬ 
cesses, in general give rise to a different detector response 
for the same deposited energy (Fabjan et al. [T977t . Tar- 
get nucleus recoil and nuclear excitations do not or only 
partially contribute to the signal (’’invisible energy”) at 
all. Particles like tt^ and 77 decay into photons which 
initiate electromagnetic showers and do not participate 
further in the hadronic cascade. Most calorimeters are 
non-compensating, which means that the response (e) 
to this electromagnetic fraction differs from that to the 
hadronic part, h, i.e. e/h % 1 (Wigmans 1987). The 
response to hadrons thus fluctuates with the fractions of 
electromagnetic and invisible energy. Since the electro¬ 


magnetic fraction increases with energy (Gabriel et al. 


1994), the hadron response also becomes non-linear. The 


jet response then in addition fluctuates with how the en¬ 
ergy is shared between particles, and with the electro¬ 
magnetic content of the jet fragmentation itself. 

As a result of these factors, one direction of research 
on hadronic calorimeters has concentrated on compensa¬ 
tion methods to restore ejh = 1 , for instance, by iden¬ 
tifying electromagnetic showers inside hadronic show¬ 
ers (Andrieu et al. 1993[ ). The best single hadron res¬ 
olution at a collider detector is achieved by the ZEUS 


calorimeter (Bernstein et al. 1993), with a = 35%. The 
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performance for jets, however, generally does not reach 
this value, since it is subject to additional degrading ef¬ 
fects. ZEUS quotes a hadronic Z mass core resolution of 
6 GeV (Abramowicz et al. 2013), which is considerably 


worse than one might naively expect on the basis of the 
single hadron resolution, and not sufficient for the separa¬ 
tion of W and Z final states. More recently, dual readout 


techniques (Akchurin et al. 2005) have been explored. 


which can in principle reduce the effect of fiuctuations 
in the electromagnetic fraction by measuring it indepen¬ 
dently. 


B. Particle flow approach 


The particle flow (Pflow) approach (Brient and Videau 
2001 Morgnno^ |2001 ) starts from the observation that 
most particles in a jet - charged particles and photons - 
can in principle be measured with much better precision 
than generally provided by the calorimeter for hadrons. 
In the range considered here, charged particle energies 
are best measured with tracking systems, which offer rel¬ 
ative resolutions of about 10 “^£'(GeV), and individual 
photon energies can be measured with relative precision 
of about 15%//B(GW) or better in electromagnetic 
calorimeters. The PFlow method aims at optimising the 
jet energy resolution by reconstructing each particle in¬ 
dividually and use the best available measurement for 
each. In a typical jet 60% of the energy is carried by 
charged particles, 30% by photons and only 10% by long- 
lived neutral hadrons {K^ and n), for which hadronic 
calorimetry is unavoidable. Assuming the above reso¬ 
lutions for tracks and photons, and 55%/E(GeV) for 
hadrons, then, in the ideal case, where each particle is 
resolved, a jet energy resolution of 19%/y^E/GeV) could 
be obtained. Here the dominant part {17%/^/E{GeV)) 
is still due to the neutral hadrons. The jet composi¬ 
tion fluctuates from event to event; so for jets with a 
smaller neutral hadron fraction the precision is higher, 
and viee versa. Particle flow-like techniques were first 
applied in the ALEPH detector ( Busknlic et a/. [P'995 ), 
which achieved a jet energy resolution of 60%/\/E, or 
6.2 GeV for hadronic Z decays. 

More recently particle flow techniques are successfully 


used in the GMS experiment (GMS 2009), which is well 


suited for this purpose. GMS has a large silicon tracker 
in a uniform solenoidal field of 3.8T, and a finely seg¬ 
mented electromagnetic calorimeter made of 75,000 lead 
tungstate crystals surrounds the tracker. The hadronic 
calorimeter segmentation is 25 times coarser, such that, 
in jets above 100 GeV/c, neutral hadrons cannot be sep¬ 
arated from charged ones, but are rather detected as an 
excess of the calorimeter energy over the tracker momen¬ 
tum. Using a careful and accurate reconstruction of the 
particle content of the event, the detector performance 
is significantly improved with respect to that using the 


calorimeter alone. This is illustrated for the simulated 
jet energy resolution as a function of transverse momen¬ 
tum in Fig. Similar improvements are achieved for 


I CMS Preliminary 



FIG. 1 Simulated jet energy resolution as a function of trans¬ 
verse momentum pr f or calorimeter jets and using particle 
flow techniques. From (Beaudette 2010 CMS 2009). 


the reconstruction of missing transverse energy and for 
hadronic r decays. 

The CMS collaboration has verified the particle flow 


performance using proton-proton collision data (Cha- 
trchyan et al. 2Qlla|b ). The transverse momentum bal¬ 
ance of dijet and 7 + jet events has been utilised to extract 
the resolution from the width of the observed jet asym¬ 
metry distributions. Soft radiation is accounted for by 
performing the measurement as a function of activity in 
the events in addition to the two jets and extrapolating 
to zero. The results are shown in Fig.j^and compared to 
the expectation based on the relative width of the sim¬ 
ulated jet energy response, corrected for an additional 
constant (c) term. This term was extracted from apply¬ 
ing the same procedure to data to the simulated events 
and is attributed to calibration imperfections. 

The CMS data confirm the expected gain in perfor¬ 
mance using the particle flow techniques. The net per¬ 
formance, however, is comparable to that of the ATLAS 


detector using calorimetric methods (Aad et al. 2013) 


and does not yet match the goals formulated for the lin¬ 
ear collider experiments. The particle flow method relies 
on the ability to properly assign the calorimetric energy 
depositions to individual particles, which places high de¬ 
mands on the imaging capabilities of the calorimeters, 
and on the pattern recognition performance of the recon¬ 
struction software. The steps towards a detector fully op¬ 
timised for particle flow consist in extending the detailed 
topological reconstruction and high granularity into the 
hadron calorimeter section, in further optimising the seg¬ 
mentation of the EGAL and the efficiency of the tracking 
system, and in developing more sophisticated reconstruc¬ 
tion algorithms. For illustration, a jet simulated in the 
highly granular ILD detector and the colour-coded re- 
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FIG. 2 Relative jet energy resolution, extracted from the mo¬ 
mentum asymmetry of dijet events measured with the CMS 
detector in proton-proton collisions. The results are compared 
to the width of the response for si mulated jets, corrected fo r 
an additional constant term. From (Chatrchyan et al. 2011a). 


suit of the reconstruction algorithm is shown in Fig. 
Only the deposits left over, after removing those associ- 



x/mm 


FIG. 3 Simulated jet in the ILD detector, with particle flow 
objects reconstructed by the Pandora algorithm shown in dif¬ 
ferent colours. From (Thomson 2009). 


ated with charged particles or identified as photons, will 
be interpreted as neutral hadrons. In practice, this can¬ 
not always be done unambiguously, if the particles im¬ 
pinge too close to each other. Mis-assignments give rise 
to additional measurement uncertainty, which is called 


confusion. For example, a neutral particle shower over¬ 
lapping with that of a charged one could remain unre¬ 
solved and mis-interpreted as part of the charged hadron 
shower, which is replaced by the track measurement; so 
the neutral energy would be lost. On the other hand, 
a detached fragment of a charged particle shower could 
be mis-identified as a separate neutral hadron, and the 
fragment energy could be double-counted. Therefore, it 
is not a priori guaranteed that the particle flow recon¬ 
struction yields a better resolution than the calorimetric 
measurement alone. 

The Pandora particle flow algorithm (PFA) (|^ om- 


son 


2009) is the most developed and best perform¬ 
ing today. Recently developed alternatives like AR¬ 


BOR (Ruan 2013) are still less performant than Pandora 


but have the merit of delivering an independent valida¬ 
tion of the Particle Flow Concept. For the assignment of 
the energy depositions to particles. Pandora makes use 
of topological information, including the sub-structure 
of showers, as well as the compatibility of calorimetric 
and track-based measurements. In this way, it is ensured 
that at higher energies, as jets get more collimated and 
particles become harder to separate, a smooth transition 
is made to a classic energy flow like reconstruction, in 
which neutral particles are rather identified as excess in 
energy above the track-based expectation, and the clas¬ 
sical, purely calorimetric performance for the jet is either 
retained or improved where the track assignment is un¬ 
ambiguous. 

The development of the algorithm has proceeded along 
with a detailed understanding of the relative roles which 
resolution and confusion effects play in different energy 
regimes, and which properties of the detectors drive the 
performance. For the use of energy momentum match 
in the assignment of energy depositions, and for energy 
flow treatment of dense jets, particle flow calorimeters 
with their emphasis on imaging must still feature a good 
energy resolution. Furthermore, the neutral hadron en¬ 
ergy uncertainty is the dominant contribution to the jet 
resolution for low energy jets, where particles are well 
separated, while at higher energies the confusion effects 
take over. For the detectors envisioned for the ILC, the 
transition is around 100 GeV, as can be seen in Fig. 
Empirically, this can be parametrized as 


rmsgo 

E 


21 

71 


©O.70O.OO4E'© 


100 J 


0.3 


% (E in GeV) 

( 1 ) 

where rmsgo is the r.m.s. of the smallest range contain¬ 
ing 90% of the eventsQ The individual terms represent 
contributions from the intrinsic calorimetric resolution. 


^ The particle flow resol ution function is not Gaussian. Its statis¬ 
tical power was shown i frii omson[ |20Q9| ) to be equivalent to that 
of a Gaussian with a standard deviation of 1.1 xrmsgo- 
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tracking imperfections, leakage and confusion. The per¬ 
formance is compared with the resolution obtained with a 
traditional approach from calorimetric information alone. 
Even at jet energies as high as 500 GeV, where confu¬ 
sion, also shown separately, becomes dominant, particle 
flow brings in a significant improvement. Note that the 
degradation at high energies is also due to leakage, which 
affects the purely calorimetric measurement much more 
severely. A particular strength of high granularity is the 
possibility to use topological information such as the re¬ 
constructed starting point of the shower for the estima¬ 
tion of leakage. This has not yet been exploited here and 
has the potential to mitigate the effects further. 


In the framework of studies for CLIC (Linssen et al. 


2012), it was shown that the required jet energy resolu¬ 


tion can be achieved with the PFlow technique for jet en¬ 
ergies up to 1500 GeV. The studies (Marshall et al 2013) 
also demonstrate that missing transverse momentum can 
be measured with a similar precision as jet energy, and 
that fake missing momentum (in one coordinate) is lim¬ 
ited to 1 - 2% of the event energy. 

Both detector concepts developed for the ILG, ILD 
and SiD, (Behnke et al, 2013), and in modified ver¬ 
sions for GLIC (Linssen et al 2012), have their design 
based on the particle flow paradigm. PFlow demands a 
highly efficient tracking system. In order to separate the 
particles, it calls for extremely compact electromagnetic 
calorimeters - to keep the Moliere radius small - and for 
unprecedented fine calorimeter granularities. To isolate 
photons and resolve the sub-structure of hadron showers. 



FIG. 4 Empirical form of the simulated ILD jet energy reso¬ 
lution as a function of energy. Also shown is the contribution 
from confusion, and, for comparison, the resolution obtained 
from the ILD calorimeter alone, and that of an ideal calorime¬ 
ter with given parameters. From (Thomson 2009). 


transverse and longitudinal cell sizes in ECAL and HCAL 
must be of the order of a radiation length Xq, resulting 
in channel counts of 10^ — 10^. Both EGAL and HCAL 
must fit inside the magnetic coil, in order not to looe con¬ 
tinuity in tracking the shower evolution. The radial and 
longitudinal distance of the calorimeter from the interac¬ 
tion point and the magnetic field should be large to allow 
separation of shower components. For the same relative 
change, the dependence on radius is stronger than on the 
field, but then cost considerations need to be folded in, 
as well. The main difference between ILD and SiD is that 
ILD has chosen to favour a larger radius tracking system 
with a time projection chamber (TPC) and a smaller 
field, while SiD follows a more compact design with an 
all-silicon tracker and a higher field. At CLIC energies, 
leakage becomes more important. Since the radius of the 
coil is limited by technical and cost considerations, tung¬ 
sten is chosen as the HCAL barrel absorber material to 
ensure sufficient shower containment. The calorimeter 
technologies, however, are very similar for all cases and 
have motivated a common R&D and validation effort. 


C. Validation approach 


The detector requirements imposed by the particle flow 
principle - high field, large size, dense materials, fine 
segmentation - drive the cost of the resulting detector 
systems far beyond that of previous e+e“ collider exper¬ 
iments. A careful optimisation is thus mandatory, and 
the quest for an experimental validation of the perfor¬ 
mance potential held by the particle flow approach is 
highly motivated. 

It has been suggested to directly test the jet energy per¬ 
formance in test beams by creating bundles of particles 
from a primary beam hadron impinging on a thin target. 
Leaving aside the differences in particle momenta and 
multiplicity, or energy density, between these ’’jets” and 
those generated in quark fragmentation at the same pri¬ 
mary energy, such an experiment would have prohibitive 


cost, as simulation studies have shown (Morgunov 2009). 


For particle flow reconstruction magnetic momentum 
spectroscopy and large acceptance are indispensable. 

Gonsequently the experimental strategy must be to 
validate the critical ingredients of particle flow calorime¬ 
try individually. First of all, the need for high granu¬ 
larity has spurred the development of novel calorimeter 
read-out technologies, such as large area silicon diode ar¬ 
rays, silicon photo-multipliers (SiPMs) for optical read¬ 
out, 2D-segmented resistive plate chambers and micro¬ 
pattern gas amplification structures. Large scale proto¬ 
types have been built and high statistics test beam data 
have been collected over several years in a worldwide 


effort organised by the GALIGE collaboration (Adloff 


et al. 2012a). The goals were to test the new technolo¬ 


gies and demonstrate their performance, to validate the 
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simulation models for hadronic shower evolution in the 
detail necessary for particle flow reconstruction and, fi¬ 
nally, to test the particle flow algorithms on real data. 
This must be done for a number of different absorber ma¬ 
terials and read-out media, since, for example, the role 
of neutron production is different for light and heavy ab¬ 
sorbers, and gas and scintillator have different sensitivity 
to the various sub-components of the showers. The sensi¬ 
tivity to, e.g. neutrons depends on the hydrogen content 
of the active medium, and on the timing of the read-out 
electronics. 

In detail, the issues to be addressed are: 

• Technical performance: The novel technologies 
must be tested with prototypes large enough to 
contain electromagnetic or hadronic showers, re¬ 
spectively, and demonstrate the expected perfor¬ 
mance in terms of noise, linearity, resolution, uni¬ 
formity and stability. 

• Detector understanding: The detector perfor¬ 
mance must be modelled in simulations in suffi¬ 
cient detail, and the models must be validated with 
muons and electrons, for which the interactions 
with the detector materials can be reliably simu¬ 
lated. 


• Software compensation: The potential offered 
by fine read-out segmentation to apply software 


compensation methods (Abramowicz et al. 1981 


Andrieu et al. 1993 Cojocaru et al. 2004) for 


restoring linearity and improving the hadronic en¬ 
ergy resolution should be realised. 


• Calibration: It must be demonstrated that the 
unprecedented number of channels individually 
read out can be monitored and calibrated to the 
required precision. 


• Hadron shower simulation models: Particle 
flow performance is driven by hadronic energy res¬ 
olution and confusion. The hadronic shower evo¬ 
lution, detector response and resolution as well as 
lateral and longitudinal extension must be correctly 
modelled and confronted with high precision test 
beam data. 


This paper deals with the issues above and thus lays 
the experimental foundation for the ingredients that go 
into full detector simulation studies with realistic and 
complex collider event topologies. These are documented 


in the context of the Pandora development (Thomson 


2009), or in the reports on detector concepts (Behnke 


et al. 2013 Linssen et al. 2012). We do not recapitu¬ 


late these studies here, but we would like to point out 
that a complete exploration of the particle flow approach 
consists of both, the full simulation studies and the ex¬ 
perimental tests reported here. 

The paper will first introduce the candidate technolo¬ 
gies for highly granular calorimeters, sketch their imple¬ 
mentation in the linear collider detector concepts ILD 
and SiD, and describe the large prototypes built and 
tested by the CALICE collaboration. Their performance 
in test beams will be presented and confronted with ex¬ 
pectations, and the required simulation details will be 
discussed. It should be noted that not all prototypes are 
at the same stage of implementation or testing. To rep¬ 
resent the current situation we present the full range of 
prototypes at their various stages of development. Im¬ 
provements achieved with different approaches to soft¬ 
ware compensation are described in a separate section, 
together with the results obtained with a combination of 
ECAL and HCAL prototypes. A calibration section deals 
with the extrapolation to a full detector system, and the 
challenges and benefits of high granularity in this respect 
will be discussed. The section on shower model valida¬ 
tion starts with an overview of current state-of-the-art 
simulation programs and presents comparisons of their 
predictions with data for a number of global observables 
as well as for shower sub-structure. A central result of the 
validation effort is the application of the Pandora particle 
flow algorithm (Thomson 2009) to CALICE data and the 
test of the two particle separation power with test beam 
events. The article closes with a near-term outlook and 
possible future directions. 


II. PARTICLE FLOW DETECTORS AND CALORIMETER 
TECHNOLOGIES 

A. The ILD and SiD calorimeter systems 


• Shower sub-structure: The reconstruction of 
individual particle showers makes detailed use of 
the shower topology, and sub-structures like tracks 
linking different fragments of the same shower, 
which should be seen in beam data and quantita¬ 
tively tested. 

• Particle flow algorithms: Particle flow algo¬ 
rithms should be applied to real test beam data, 
and their capability to resolve the topologies and 
to separate particle showers from each other should 
be measured and compared with predictions. 


The ILD and SiD calorimeter systems are central fea¬ 


tures of these detector concepts (Behnke et al. 2013 


Linssen et al. 2012). The calorimeters are part of in¬ 


tegrated detector designs to take advantage of the PEA 
approach to achieving the excellent jet energy resolutions 
required by experiments at future lepton colliders. Quad¬ 
rant views of the ILD and SiD detectors are shown in 
Fig-i and Eig. [^respectively. 

Somewhat modified designs have also been developed 
for experiments at the higher energies of CLIC, the main 
differences being greater calorimeter depths, and a more 
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FIG. 5 
2013). 


The ILD Detector for the ILC. From (Behnke et al.\ 



FIG. 6 
2013). 


The SiD Detector for the ILG. From (Behnke et al.\ 


demanding timing requirement. 

The most significant difference between the ILD and 
SiD designs lies in the tracking systems: ILD uses a time 
projection chamber (TPC) supplemented with layers of 
silicon, while the SiD tracker is an all-silicon system. As 
a result, the inner radius of the ILD calorimeters is larger 
than that for the more compact SiD design. The central, 
solenoidal, magnetic fields have values of 3.5T and 5T 
for ILD and SiD respectively. ILD benefits from greater 
track separation at the entrance to the calorimeter, due 
to the larger radius, while SiD has high precision track¬ 
ing in a limited number of layers in its compact design 
using a larger field. For both ILD and SiD, the calorime¬ 
ter systems are separated into electromagnetic (ECAL) 
and hadronic (HCAL) depth sections. The SiD HCAL 
features a long barrel with a plug forward HCAL, while 
ILD has a shorter barrel HCAL with endcaps. In both 


concepts the iron flux return yoke is foreseen to be instru¬ 
mented for muon tracking and to act as a tail catcher to 
the calorimeters. 

The calorimeter implementations for use with a PFA 
are highly granular in nature - as required to record very 
detailed images of showers for subsequent identification 
and separation of energy deposits, and the measurement 
of the energies of neutral particles. Early studies showed 
that for small enough cell sizes there is an approximately 
linear relation between the number of calorimeter cells 
recording a hit for a shower and the energy of the par- 
ticle(s) causing the shower (Ammosov et al 2004). At 
very high hit densities saturation effects occur and cor¬ 
rections need to be made. Several variations on this ap¬ 
proach have been developed, digital and semi-digital, in 
addition to the more conventional analog approach. The 
fully digital method applies a predetermined threshold to 
each cell as the data is taken. The semi-digital method is 
the same except that it allows for more than one thresh¬ 
old to be applied. The analog approach records cell signal 
magnitudes and stores the information for offline recon¬ 
struction. In the digital methods based on hit counting 
the energy resolution depends on the longitudinal and 
transverse granularity, while in the analogue case it de¬ 
pends on the longitudinal sampling only. Therefore the 
digital methods require smaller cell sizes. 

The ECAL serves to identify electrons and photons 
and to measure photon energies. Approximately 60% 
of hadrons interact in the ECAL featuring a depth of 
about one interaction length. In the ECAL overlapping 
photon-hadron energy deposits need to be disentangled 
and electron-hadron charged tracks need to be separated. 
To minimise overlaps, electromagnetic showers must be 
confined as much as possible, favouring absorber mate¬ 
rials with a small Moliere radius. Additionally, to facili¬ 
tate the separation and identification of electromagnetic 
and hadronic showers, it is helpful to have a large ra¬ 
tio between the interaction length and radiation length. 
Distinguishing electromagnetic showers is facilitated by 
using a high transverse granularity, and fine depth seg¬ 
mentation with many layers. This latter feature also as¬ 
sists with following charged particles through the ECAL 
to the HCAL. 


The ECAL’s for ILD and SiD have tungsten as the 
absorber. Tungsten has a Moliere radius of 9.7mm sat¬ 
isfying the requirement to narrowly confine electromag¬ 
netic showers. Silicon is the material of choice for the 
ECAL active layers, as such layers are easily segmented, 
although there is an alternative design for the ILD ECAL 
that uses orthogonal scintillator strip layers. The com¬ 
bination of silicon sensor layers and tungsten absorber 
allows for compact active layer designs and for division 
into small (0(5 mm)) cells in the transverse plane. Lon¬ 
gitudinally the ILD and SiD ECAL’s have 30 layers, with, 
for instance, SiD having the first 20 layers with 2.5 mm 
tungsten thickness and 1.25 mm readout gap, and the 
































































































last 10 layers having 5.00 mm tungsten - a compromise 
between cost, the sampling frequency, and the contain¬ 
ment of showers. Providing full information for a particle 
flow algorithm demands that every cell in every layer is 
read out. 

Two different designs have been developed and proto¬ 
types constructed for ECAL’s using silicon and tungsten, 
one by CALICE and a second independently by SiD. 

The SiD ECAL design is based on a self-supporting 
structure of tungsten plates interconnected with screw 
and insert-spacer assemblies. Wedge-shaped modules, for 
the barrel ECAL, are assembled by stacking alternating 
layers of tungsten and silicon - with the sensor layers per¬ 
manently installed in the structure. The current baseline 
design of the sensor layers is a tiling scheme using 15 
cm hexagonal sensors, each subdivided into 1024 hexag¬ 
onal pixels of 13 mm^ area. All pixels are connected to a 
single KPiX 1024-channel ASIC mounted directly in the 
center of the sensor. The digitised data are read out via 
Kapton flex cables, which also carry power. The overall 
power requirement is significantly reduced by employing 
power pulsing, a central feature of ILC detectors made 
feasible by the long interval between bunch trains. A 
view of the layer structure for the SiD ECal is shown in 

Fig.0 


integrated into the layer structure. In the ILD baseline 
the ASICs will be bonded onto a very thin multilayer 
printed circuit board that is part of the assembly that is 
inserted into the alveolar structure. The SKIROC ASICs 
as well as others of the same ’family’ mentioned in this 
section combine signal amplification, shaping, triggering 
and digitisation. The ASICs can be power pulsed and 
address thus all aspects needed for an experiment at the 
LC. In parts their performance are subject of this review. 



FIG. 8 Prototype ILD Silicon-Tungs ten Electromagnetic 
Calorimeter. From (Behnke et al. 2013) 
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FIG. 7 The layer structure of the SiD Silicon-Tungsten Elec¬ 
tromagnetic Calorimeter. 

An alternative for the ECAL is based on Monolithic 
Active Pixel Sensors with 50 micron x 50 micron sili¬ 
con pixels. This is a digital approach to electromagnetic 
calorimetry (DECAL). Such sensors could be manufac¬ 
tured in a commercial mixed-mode CMOS process using 
standard 300 mm wafers. Eirst-generation DECAL sen¬ 
sors with 168 X 168 pixels have been manufactured and 
tested. The sensor supports single-bunch time stamping 
with up to 13 bits and power-pulsing. 

The design retained so far by ILD is pursued by the 
CALICE collaboration. It uses an alveolar structure of 
carbon fibre, into which are inserted bare tungsten plates 
and tungsten plates carrying sensor, readout elements 
and control services on both sides. The silicon sensors 
are segmented into square pads of 5 mm size. A view of 
a prototype for this structure is shown in Eigurej^ The 
front-end electronics for this calorimeter are provided by 
ASICs, called SKIROC2 dCallier et aQ |rt| ) that are 


An alternative CALICE ECAL design, to reduce the 
channel count and cost, uses 5 mm wide and 45 mm long 
orthogonal scintillator strips with silicon photo-multiplier 
(SiPM) readout in alternating layers between tungsten 
absorber plates. The readout electronics is provided by 
ASICs adapted to the readout of silicon photomultipliers. 


SPIROC2 in this case (Conforti Di Lorenzo et al. 2013) 


bonded to a thin printed circuit board. 

The HCAL must allow efficient separation and identifi¬ 
cation of energy deposits from charged particles, and pro¬ 
vide an adequate measurement of the energies of neutral 
hadrons. These requirements again argue for fine trans¬ 
verse and longitudinal segmentation. The transverse seg¬ 
mentation should be small compared to the typical size 
of a hadronic shower, while longitudinally there should 
be a large number of layers for shower pattern recog¬ 
nition, while being consistent with sufficient thickness 
to contain a high fraction of the energies of the most 
energetic particles/jets. The whole calorimeter system 
must be contained within the inner radius of the central 
solenoid, whose size is limited by achievable technology 
and cost, and whose thickness corresponds to about 1.5 
nuclear interaction lengths. The possibility exists to use 
the first layers of the muon system, radially outside the 
coil, as a tail-catcher, to identify and measure the energy 
of any small components of hadronic showers that prop¬ 
agate through the calorimeters and coil (for the barrel 
section of the calorimeter). 

The analog HCAL uses layers of small scintillator tiles 
and iron or tungsten absorber. Each small tile, with 3 



































9 


cm X 3 cm size, has an embedded wavelength shifting fi¬ 
bre which is coupled to a SiPM. Studies have shown that 
there is little to be gained from using a cell size smaller 
than this - see Fig. The scintillator tiles provide en¬ 
ergy and position resolution and have been shown to give 
a uniform response across layers and high efficiency for 
minimum ionising particles (MIPs) - essential ingredients 
for a successful particle flow calorimeter. Ongoing work 
has indicated that it may be possible to have the SiPMs 
directly coupled to the scintillator tiles with the latter be¬ 
ing shaped to achieve good uniformity of response across 


their faces (Abu-Ajamieh et al 2011 Blazey et al 2009 


Simon and Soldner 2010). The analog HCAL for ILD 
is divided into 48 longitudinal layers, corresponding to 
six interaction lengths to retain good calorimeter perfor¬ 
mance up to 1 TeV center-of-mass energy. The front-end 


readout electronics consists of SPIROC ASICs (Conforti 


Di Lorenzo et al. 2013) mounted on a printed circuit 


board and connected to the SiPMs carried with the scin¬ 
tillating tiles, with the complete assembly being the ac¬ 
tive layer between absorber plates. Interface boards for 
signal, calibration and power are located at the end faces 
of AHCAL modules. 


a pad is counted as either hit or not hit depending on 
the result of a comparison between the signal size and a 
pre-determined threshold, downloaded to the front-end 
electronics. The 1 cm^ pad size was chosen to mitigate 
the effects of saturation on energy resolution. A pad 
multiplicity of less than two has been achieved for a hit 
efficiency of 95%. The ASIC used has been the DCAL 
chip developed specifically for this purpose. This glass 
RPC design is the baseline choice for the SiD HCAL sys¬ 
tem. In recent developments, a one-glass (the one with 
the resistive paint) RPC has been tested and offers the 
possibility of a thinner active layer, together with a hit 
multiplicity close to one independent of hit efficiency. 



FIG. 10 A schematic of the layer structure of the RPC Digital 
Hadron Calorimeter 



HCAL Cell Size/cm 


FIG. 9 Simulated jet energy resolution of the ILD detecto r 
as a function of the AHCAL cell size. From (Thomson 2009). 


In contrast to the scintillator-based HCAL, a number 
of gas-based calorimeters have been proposed. Two vari¬ 
ations of calorimeters using Resistive Plate Chambers 
(RPC) have been built and tested. The first is a fully 
digital RPC system with glass plates acting as resistive 
plates. In the original design, two glass plates, coated on 
their outer surfaces with resistive paint, define an active 
layer with a typical thickness of less than 8 mm. High 
voltage is applied to the resistive layers. A charged par¬ 
ticle crossing the gas gap between the plates causes an 
avalanche which induces a signal on readout pads adja¬ 
cent to the plates. A schematic of the layer structure 
is shown in Fig. 10 A pad size of 1 cm^ is used and 


A multiple threshold variation of the glass RPC design 
for the ILD HCAL has also been developed. This semi¬ 
digital RPC HCAL, called SDHCAL hereafter, uses two- 
bit readout, implementing three thresholds. This allows 
mitigation of saturation effects provoked by large energy 
deposits, and the determination of whether one, a few, or 
many particles crossed a given cell - providing additional 
information to the particle flow algorithm. The front- 


end electronics is provided by HARDROC ASICs (Du- 


lucq et al. 2010) mounted on one side of a printed circuit 


board, the other side of which carries the inductive signal 
pickup pads. 

Gas-based HCAL designs have also been developed to 
take advantage of micro-patterned gas detector technol¬ 
ogy, using both micromegas and gas electron multiplier 
(GEM) approaches. In the micromegas design a com¬ 
mercial 20 micron woven mesh separates the 3mm drift 
gap from a 128 micron amplification gap. Signals are 
acquired on 1 cm^ pads on one side of a printed cir¬ 
cuit board, the other side of which contains the readout 
ASIC’s. This approach is proposed as an alternative to 
the RPC semi-digital design and also uses several signal 
thresholds. A view of a large area, assembled micromegas 
digital hadron calorimeter plane is shown in Fig. 

Finally, two approaches using gas-electron-multiplier 
technology (GEM) have been proposed. The first GEM- 
based design uses two layers of GEM foils separated by 
1 mm, a 3 mm drift region in front of the foils, and 1 
mm induction region beyond the foils. As for the mi¬ 
cromegas case, signals are collected on 1 cm^ anode pads 
on a printed circuit board also containing the readout 
ASICs. The ASIC used with this technology has been 
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FIG. 11 A view of a plane of the micromegas-based Digital 
Hadron Calorimeter 


the KPiX 1024-channel analog device, allowing the pos¬ 
sibility of recording the signal level on each pad for later 
processing with offline thresholds. A view of a large GEM 
chamber under construction is shown in Fig. \T^ 



FIG. 12 A plane under assembly for the GEM-based Digital 
Hadron Calorimeter 
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FIG. 13 A thick-GEM structure which provides discharge 
protection. 



ECAL 

HCAL 

Absorber 

tungsten 

stainless steel (ILC) 
tungsten (CLIC barrel) 

Analogue 

silicon, scintillator 

scintillator 

Digital 

MAPS 

RPC, GEM, Micromegas 


TABLE I Absorber materials and read-out technologies con¬ 
sidered for particle flow calorimeters. Those realised in large 
prototypes are printed in bold face. 


CERN and Fermilab facilities. Given the scale of the ef¬ 
fort, this only became possible by maximising the use of 
common infrastructure, such as mechanical devices and 
absorber stacks, a common data acquisition system and a 
family of front end ASICs with common building blocks, 
computing, storage and reconstruction software. In the 
following we present the large prototypes, capable of pro¬ 
viding full shower physics data, and a few smaller set-ups 
for technical tests. 


The second GEM-based design uses thick-GEM 
(THGEM) technology with holes in 400-500 micron thick 
circuit boards. Several single and double THGEM struc¬ 
tures have been developed. Promising solutions for thin 
active layers use a single THGEM which includes a re¬ 
sistive layer to prevent sparking and shield the front-end 
electronics from the effects of any residual discharges. A 
schematic of a possible THGEM structure is shown in 
Fig. [131 


B. The large CALICE beam test prototypes 

The candidate absorber materials and read-out tech¬ 
nologies for highly granular electromagnetic and hadronic 
calorimeters are summarised in Table [H 

Test beam experiments played a key role in establish¬ 
ing these technologies and validating the associated sim¬ 
ulations. A large international effort has been carried out 
in the framework of the CALICE collaboration, where all 
major technologies have been exposed to particle beams, 
and large sample of data has been collected at DESY, 


a. SiW ECAL: The CALICE SiW ECAL group has 


developed a first so-called “physics prototype“ (Repond 


et al. 


2008), shown in Fig, 


14 


whose aim was to demon¬ 
strate the ability of such an ECAL to meet the perfor¬ 
mance requirements. For details of the layout please con¬ 
sult Ref. (Repond et al 2008). Here only the gross fea¬ 
tures are recapitulated. The physics prototype has an 
active area of 18 x 18 cm^ in width and approximately 
20 cm in depth, subdivided longitudinally into 30 layers. 
The layers are composed alternately of W absorber plates 
and a matrix of PIN diode sensors on a silicon wafer sub¬ 
strate. The active part of a layer consists of 3 x 3 silicon 
wafers featuring a matrix of 6 x 6 PIN diodes. Altogether 
the SiW ECAL comprises thus a total of of 9720 1x1 cm^ 
calorimeter cells. Electronic read-out proceeds via an 
18 channel ASIC, FLC_PHY3 (Repond et al 2008) fol¬ 
lowed by an off-detector digitisation and data acquisition 
system (Dauncey, 2002). At normal incidence, the proto¬ 
type has a total depth of 24 Yq achieved using 10 layers 
of 0.4 Yo tungsten absorber plates, followed by 10 layers 
of 0.8 Yo, and another 10 layers of 1.2 Yq thick plates. 
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FIG. 14 The CALICE SiW ECAL physics prototype detector. 


Each layer is subdivided into a central part featuring a 
3x2 array of silicon wafers and a bottom part consisting 
of a 3 X 1 array of silicon wafers. In 2006 the bottom 
part of the first six layers was missing. The detector was 
progressively developed until completion in 2008. 


b. SciW ECAL: The CALICE scintillator strip-based 
ECAL (ScECAL) achieves the required granularity for 
Particle Flow with a scintillator strip structure. Scintil¬ 
lator layers interleaved with absorber plates are placed in 
two alternative orientations, with horizontally and verti¬ 
cally aligned strips, to achieve effectively fine square seg¬ 
mentation. Each strip is individually read out by a Multi 
Pixel Photon Counter (Gomi et al 2007) (MPPC, a sil¬ 
icon photo-multiplier produced by Hamamatsu Photon¬ 
ics) with 1600 pixels. Signals from the MPPCs are read 
out by the same front-end electronics developed for the 
CALICE analog hadron calorimeter prototype (Adloff 


et al 2010a). 


The first ScECAL prototype, consisting of 468 chan¬ 


nels, was constructed and tested at DESY (Francis et al. 


2013). It consisted of 26 pairs of 3 mm thick scintillator 


and 3.5 mm thick absorber layers. The absorber ma¬ 
terial was composed of 82% tungsten, 13% cobalt and 
about 5% carbon. Each scintillator layer consisted of two 
45 X 90 mm^ “mega-strip” structures consisting of nine 
45 X 10 mm^ strips. The total active volume was about 
90 X 90 X 200 mm^. The total thickness of the prototype 
was 17.3 radiation lengths. The mega-strips were pro¬ 
duced by machining holes and grooves in a 3 mm-thick 
Kuraray SCSC38 plastic scintillator plate. Two types of 
detection layers were produced: one with a 1 mm di¬ 
ameter Kuraray Y-11 wavelength shifting (WLS) fibre 
running along the length of the strip (type-F), and the 
other without the WLS fibre or its associated hole (type- 
D).The presence of the WLS fibre improves the response 
uniformity along the strip length. 


Following the experience of the first prototype tested 
at DESY, the physics prototype of ScECAL, with 30 
layers and a transverse size of 180 x 180 mm^ was con¬ 
structed, with a total of 2160 channels. Rather than the 
mega-strips used in the first prototype, the physics pro¬ 
totype used individual small scintillator strips, wrapped 
with reflective foil to increase photon yield and reduce 
optical cross-talk between strips (Fig.^^left). Each ac¬ 
tive layer consisted of 72 scintillator strips with a size of 
45 X 10 X 3 mm^. Each strip, produced by an extrusion 
method, had a hole along its length into which a 1 mm 
diameter Kuraray Y-11 WSF fibre was inserted along the 
strip {Fig.fib\right) . The physics prototype was equipped 
with an improved calibration system, based on LEDs and 
notched clear fibers, to monitor the gain of the MPPCs in 


the physics prototype. It was tested at Fermilab (CAL¬ 


ICE 2012c), mounted in front of the CALICE AHCAL 


physics prototype. 



EIG. 15 Left: An active layer of the SciECAL prototype, 
consisting of 72 scintillator strips, where each strip is wrapped 
with reflective foil having a hole for calibration LED light. 
Right: Schematic view of a scintillator strip with a WLS fibre 
and a MPPC. 


c. Sci Fe and W AHCAL: The scintillator-based ana¬ 
logue HCAL (AHCAL) prototype (Adloff et al 2010a) 
is a sandwich structure made of 38 layers of scintilla¬ 
tor tiles (5 mm thick) interleaved with steel absorber 
plates of 17 mm thickness, which were later replaced by 
10 mm thick tungsten plates (Adloff et al. 2014a). The 
active layers are housed in steel cassettes with two times 
2 mm thick cover plates, which contribute to the absorber 
structure. The total thickness corresponds to 5.3 nuclear 
interaction lengths. The transverse dimensions of the ac¬ 
tive part are 90 x 90 cm^. The tile size is 3 x 3 cm^ in the 
central region; for the outer and rear parts larger sizes 
are used, see Fig. In total there are 7608 tiles, each 
individually read out via a wavelength-shifting fibre by 
a silicon photo-multiplier (SiPM) produced by MEPhI / 
PULSAR in Russia ( Buzhan et aL\ 2003[ ). 

The CALICE AHCAL prototype was the first appli¬ 


cation of SiPMs on a large scale. SiPMs (see (Renker 


and Lorenz 2009) for a review) are pixelated avalanche 


photo-diodes operated in Geiger mode, with a typical 
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FIG. 16 Active layer of the AHCAL prototype, scintillator 
tile with SiPM. 


gain of 10^. The pixels, 1156 on a 1.1 x 1.1 mm^ sur¬ 
face, have individual quenching resistors and operate on 
a common substrate. The output charge signal is pro¬ 
portional to the number pixels fired by photo-electrons 
and measures the light intensity. Electronic read-out pro¬ 
ceeds via an 18 channel ASIC, FLC.SiPM, which is based 
on the FLC_PHY3 chip (Repond et al 2008) used for 
the ECAL prototype and therefore compatible with the 


same digitisation and data acquisition system (Dauncey 


2002). A LED based calibration system was used to mea¬ 


sure the gain of each SiPM, and their temperature was 
monitored by five sensors in each layer. 

Test bench characterisation of SiPMs and tiles plays 
a vital role. For each SiPM, the over-voltage (reverse 
bias voltage excess over breakdown voltage) was adjusted 
to equalise the light yield to about 12 pixels per MIP 
normally traversing a tile, and the non-linear response as 
a function of light intensity was recorded for use in the 
offline reconstruction. 


d. RPC Fe and W DHCAL: The Digital Hadron 
Calorimeter or DHCAL uses Resistive Plate Chambers 
(RPCs) as active elements (Drake et al. 2007). The 
chambers are read out with 1x1 cm^ pads and 1-bit 
(digital) resolution. A small-scale prototype was assem¬ 
bled and tested in the Fermilab test beam in 2007 to 
validate the concept (Bilki et al. 2008 2009a|b|c 2010). 

A large prototype with up to 54 layers and close 
to 500,000 readout channels was built in 2008 - 2011. 
Each layer measured approximately 96 x 96 cm^ and was 
equipped with three chambers, stacked vertically on top 
of each other. 

In the test beams the DHCAL layers were inserted into 
a main stack of 38 or 39 layers, followed by a tail catcher 
with up to 15 layers. For the tests performed at Fermilab 
the main stack contained steel absorber plates (Adloff 


et al. 2010a). At CERN the absorber plates contained 


a tungsten based alloy. In both cases the tail catcher 
featured steel absorber plates. 

In the various test beam campaigns combined, span¬ 
ning the years 2010 - 2012, the DHCAL recorded several 
ten million muon and secondary beam events, where the 
latter contained a mixture of electrons, muons, pions, and 
protons. A photograph of the setup in the CERN test 


beam is shown in Fig. 



FIG. 17 Photograph of the DHGAL setup in the PS test 
beam at GERN. 


e. RPC Fe SDHCAL: Similar to the DHCAL described 
above the CALICE RPC-SDHCAL (Mannai et al. 2013) 
uses glass resistive plate chambers as the sensitive 
medium. By virtue of a pad board a 100 x 100 cm^ large 
RPC chamber is subdivided into cells of 1 x 1 cm^ The 
chambers are coated with a novel mixture of colloidal 
graphite allowing for the application of the silk screen 
print method that ensures a uniform effective surface re- 
sisitivity. The chambers are integrated into a stainless 
cassette with an overall thickness of 11mm that consti¬ 
tutes therefore a part of the absorber medium. This steel 
cassette hosts also the readout components consisting of 
the HARDROC ASICs mounted on PCBs. The steel cas¬ 
settes are inserted into a mechanical structure that can 
host up to 51 cassettes. The mechanical structure fea¬ 
tures stainless steel plates of 1.5 cm thickness as the main 
absorber medium. A layer composed of the cassette and 
an absorber plate has a depth of about 0.12 hadronic in¬ 
teraction lengths A/. For beam tests 50 RPC-chambers 
were built within six months at the beginning of 2012 
and the prototype has been operated with 48 chambers. 
A water cooling system, together with the capabilities of 
the HARDROC ASICs to cycle the power supply syn¬ 
chronously to the duty cycle of the beam, provided an 
important noise reduction. The prototype was commis¬ 
sioned in 2012 and an extensive beam test program has 
been conducted at the PS and SPS beam test facilities of 
CERN. A picture of the prototype is shown in Fig. [TM 


f. Alternative SDHCAL technology: A 1 m^ mi- 
cromegas chamber comprises 9216 cells. The PCBs and 
the readout ASICs are integrated into the gas volume. 
This prevents a large energy deposition in the mesh which 
in turns protects the ASICs from being damaged in case 
of a spark. In total 4 of these chambers were built and 
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FIG. 18 CALICE’S Semi-Digital Hadronic Calorimeter pro¬ 
totype during the April 2012 test beam at CERN. 


operated in beam tests in 2012. They were tested in 
standalone mode as well as interleaved with the RPC 
chambers described before. The latter configuration al¬ 
lowed for the study of the response to hadronic showers 
without a large number of chambers. 


g. Sci and RPC Fe TCMT: Hadron test beam set-ups 
for runs at higher energies have been complemented by a 


tail-catcher muon-tracker (TCMT) system (Adloff et al 


2012b). The TCMT steel structure consists of two sec¬ 


tions, a first one with nine 21 mm thick absorbers, and 
a second, coarser one with 104 mm thick absorbers. The 
plates are interleaved with, in total, 16 read-out layers, 
each made of 20 scintillator strips, 5 cm wide and ar¬ 
ranged in alternating horizontal and vertical orientation. 
The strips are read out via wavelength-shifting fibres 
coupled to SiPMs and use the same electronics as the 
AHCAL. In most DHCAL runs, the TCMT was instru¬ 
mented with RPCs. 


C. Test beam overview 

Following commissioning runs and initial tests with 
electrons at DESY, the major CALICE test beam cam¬ 
paign started with a combination of silicon ECAL and 
scintillator AHCAL, plus TCMT, at the CERN SPS in 
2006-2007, shown in Fig.[^ Until 2012, all major ECAL 
and HCAL readout-out technologies and HCAL absorber 
materials were tested in different combinations at Fermi- 
lab and CERN, see Table [Ilj At both sites, data were 
also taken with the HCAL in stand-alone mode, without 
ECAL in front, to validate the performance with elec¬ 
trons and low energy hadrons. 

Altogether, more than half a billion physics events have 
been collected, not including calibration data recorded 
with muons or optical signals. Data management and 



EIG. 19 CALICE test beam set-up with ECAL, HCAL and 
TCMT at the CERN SPS. 


processing has been based on tools developed for the LDC 
grid. For this the virtual organisation calice has been 
created that is hosted by DESY. 


III. CALORIMETER PERFORMANCE IN TEST BEAMS 

The technologies proposed for the realisation of highly 
granular calorimeters had not been used before in 
calorimeters. SiPMs were completely novel, silicon pad 
diodes had been used in smaller devices only, RPCs, 
GEMs and micromegas had typically been combined with 
strip read-out only, or did not cover large areas. So the 
first question to ask was whether these devices function at 
system level, can be operated reliably, and deliver the ex¬ 
pected calorimetric performance in test beams. Second, 
in order to validate the simulations of hadronic show¬ 
ers and arrive at conclusions on the adequacy of the un¬ 
derlying physics models, the new detectors must be un¬ 
derstood in terms of simulations. Since the evolution 
of electromagnetic particles can be predicted with much 
higher precision, the response of the prototypes to elec¬ 
trons or positrons is first used to quantitatively validate 
the detector modelling of the new prototypes. A special 
issue common to all technologies is the robustness of the 
embedded read-out chips with respect to possible shower 
induced malfunctions, which was the subject of dedicated 
studies. 


A. CALICE silicon tungsten ECAL 


Results presented in this review are based on large 
statistics samples of electrons, pions and muons recorded 
at CERN and FNAL between 2006 and 2011. A detailed 
overview of the beam test performance of the physics pro¬ 


totype is given in Refs. (Repond et al, 2008) and (Adloff 


et al. 2009). The detector noise is reported to be 13% of 


a MIP. This small value enables single MIP detection in 
reconstruction algorithms for Particle Flow. The noise 
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TABLE II Summary of CALICE test beam periods. 


Year 

Beam 

W EGAL read-out 

HGAL read-out 

HGAL absorber 

Fe Tail catcher read-out 

2006-07 

GERN SPS 

Silicon pads 

Scint. analogue 

Fe 

Scint. 

2008 

FNAL FTBF 

Silicon pads 

Scint. analogue 

Fe 

Scint 

2008-09 

FNAL FTBF 

Scint. strips 

Scint. analogue 

Fe 

Scint. 

2010 

FNAL FTBF 

- 

RPG digital 

Fe 

Scint. 


GERN PS 

- 

Scint. analogue 

W 

- 

2011 

FNAL FTBF 

Silicon pads 

RPG digital 

Fe 

RPG 

2011 

GERN SPS 

- 

Scint. analogue 

W 

Scint. 

2011-12 

GERN PS, SPS 

- 

RPG semi-digital 

Fe 

- 

2012 

GERN PS, SPS 

- 

RPG digital 

W 

RPG 


and the response to minimum ionising particles are uni¬ 
form throughout the detector as demonstrated by Fig.[^ 
In (Repond et al 2008) issues with coherent noise are re- 



Pad index 


EIG. 20 Top: Detector noise of the CALICE SiW ECAL 
physics prototype as measured in the 2006 beam test cam¬ 
paign at CERN as a function of the cell number. Bottom: The 
most probable value of the response to minimal ionising par¬ 
ticles as a function of cell number. The different colours dis¬ 
tinguish wafers from different vendors. The inset histograms 


display the projections on the y-axis. Both from (Repond 


et al 2008). 


ported. During the beam test operation, in particular for 
highly energetic electrons, a square pattern on the wafer 
periphery has been observed. The reason for that is a 
capacitive coupling of the guard ring that surrounds the 
silicon wafer with the actual silicon pads. The observa¬ 
tion of these ’square events’ triggered R&D on the guard 
ring design. The studies conclude that the frequency of 
square events can be largely reduced by a segmentation 


of the guard rings (Cornat 2009). 

A detailed analysis of the response of the physics pro¬ 
totype to electrons is published in (Adloff et al. 2009). 
The analysis selects events impinging on the detector rea¬ 
sonably far from the boundaries of the silicon wafers. In 
addition a correction procedure has been developed for 
residual losses in the gaps between the wafers. Finally, 
the total energy deposit is calculated according to 


Erec(MIPs) = ^«;iEi (2) 

i 


The sum runs over the thirty layers of the prototope and 
Ei is the energy deposition in one layer. The weight¬ 
ing factors are given by Wi = Ki rji. The value 
Ki = 1, 2, 3 reflects the varying sampling fraction due to 
the increase of the absorber material in the three mod¬ 


ules, see page[^ The value r]i corrects for effects of the 
internal structure of the SiW ECAL. The correction is 


maximally 7%. Based on this weighting. Fig. 21 shows 
the response of the prototype to an electron beam with 
an energy of 30 GeV for data and Monte Carlo simula¬ 
tion. Both agree well apart from the tail on the left hand 
side of the peak. The disagreement is attributed to resid¬ 
ual radiative effects (i.e. bremsstrahlung in the beam line 
material), residual pion background and maybe also en¬ 
ergy losses in the space between the wafers. The resulting 
distribution can be well fitted by a Gaussian in the range 
[—Icr, 2cr] around the mean value Emean- The linearity 
and energy resolution of the detector is given in Fig. 

The linearity is approximated by a fit of the form 
Emean = I^Eijeam “ OL. The parameter is a simpie con¬ 
version factor from the beam energy in GeV to the scaie 
of the detector response in units of MIP. The parameter 


a parameterises an offset that according to Ref. (Adioff 


et al. 2009) is attributed to iosses of information, i.e. 


energy depositions, that are discarded due to the noise 
cut in the anaiysis of 0.6 MIP. It is found that between 
6 and 45 GeV the detector response is iinear within 1%. 
In this energy range the energy resoiution is determined 
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FIG. 21 Response of the CALICE SiW ECAL to a electron 
beam with an energy of 30 GeV. Erom (Adloff et al. 2009). 


to be 


r(^„ 


(16.53 ± 0.14(stat) ± 0.4(syst))% 


-S'meas VEiGeV) 

© (1.07 ± 0.07(stat) ± O.l(syst)) %, (3) 

The resulting energy resolution is clearly inferior to 
that obtained for other calorimeter technologies, e.g. for 
crystals. It should however be stressed at this point that 
the detector design emphasises granularity, and thus fine 
sampling, over the pure calorimetric response. 

The high granularity of the SiW ECAL leads naturally 
to a rich amount of information which can be exploited 
using advanced, e.g. imaging processing techniques. One 


of these techniques is the Hough transformation (Hough 


1962). The Hough transformation provides a mapping 


from a n-dimensional feature space onto a m-dimensional 
parameter space, also called Hough space. Briefly, points 
which are on a straight trajectory as e.g. generated by 
a MIP, will all result in the same parameter set in the 
Hough space. Thus, by subjecting the calorimeter cells 
which carry energy at or above one MIP to a Hough 
transformation, a MIP trajectory will lead to an accu¬ 
mulation at a given set of parameters. 

A result of a simulation study for muons overlaid on 


30 GeV electrons carried out in (CALICE 2010) is shown 


in Fig.[^ It demonstrates that a full separation of close- 
by particles can be achieved for distances down to 2.5 cm. 


The feasibility of having embedded readout electronics 
for a calorimeter proposed for a future lepton collider has 


been studied in (Adloff et al. 2011a). A detailed analy¬ 


sis of noise spectra of the ASICs exposed to high-energy 
electron beams has revealed no evidence that the noise 
pattern is altered under the influence of the electromag¬ 
netic showers. The probability to have fake signals above 
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EIG. 22 Top: Linearity of the energy response of the CAL¬ 
ICE SiW ECAL as a function of the nominal energy. The line 
represents a fit of the form pEbeam — ot to the measured energy 
Emean- Parameters a and /3 are explained in the text. Bot¬ 
tom: Relative energy resolution (cr(Emeas)/^meas) as a func¬ 
tion of 1/VEbeam (solid squarcs), and its usual parameter- 
isation as s/ y/E 0 c. The values expected from simulation 
are shown (open squares). The dashed line gives the fitted 
resolution for data (Equation]^, and the dotted lines corre¬ 
spond to its variation when the beam energy s cale is shifted 
by ± 300 MeV. Both from ( [Adloff et al. 2009). 


the MIP level is estimated to be smaller than 6.7 • 10“^ 
per shower. The probability for a fake signal is less than 
10“^ for a threshold of 2/3 of a MIP. For an event of the 
type e+e“ ^ tt at ^/s = 500 GeV at a lepton collider 
about 2500 cells of dimension 1x1 cm^ are expected to 
carry a signal above noise level which is typically defined 
to be (60-70)% of a MIP. The resuits thus reveaied no 
probiems for the design of embedded readout eiectronics 
for a detector for a iepton coiiider. 







































16 



30 40 50 60 

distance MIP shower axis [mm] 


FIG. 23 Simulated MIP detection efficiency in the CALICE 
SiW ECAL as a function of the distance between the shower 


axes of an el ectron and of a muon. Erom (CALICE 2010 
Poschl 2011). 


been studied and is seen to follow the usual distribution. 
However, significant crosstalk problems were discovered 
and are being investigated. 



EIG. 25 A single electron shower event in the SiD ECAL 
prototype stack. 


B. SiD silicon tungsten ECAL 


C. Scintillator tungsten ECAL 


In order to demonstrate the feasibility of assembling a 
highly compact electromagnetic calorimeter, with printed 
circuit boards and with direct bonding of chips to wafers, 
a first prototype stack for an SiD ECAL has been con¬ 
structed. A section of this SiD ECAL with KPiX read¬ 
out has been exposed to a 12.1 GeV electron beam at 
the SLAC ESTB facility. A schematic of the test setup 
is shown in Fig. This agressive design has an active 
gap between absorber plates of 1.25 mm, a cell size of 
13 mm^, and an effective Moliere radius of 14 mm. 



FIG. 24 Gonhguration of 15 cm size hexagonal sensor planes, 
each read by one KPix chip placed at the centre and inter¬ 
leaved between tungsten plates for the beam test of the SiD 
EGAL prototype. 

In the first tests, a stack of nine silicon sensor planes 
and eight tungsten plates (corresponding to six radiation 
lengths) was exposed to beam. The full stack will ulti¬ 
mately consist of 30 layers as in the SiD ECAL design. 

Data was taken over a four-day period with a beam 
rate between 0.5 and 5 electrons per pulse. Fig. shows 
an example of a single electron shower in the test stack. 
The development of the shower, and the transverse dis¬ 
tribution of digital hits at each sensor plane, are clearly 
visible. The longitudinal profile of electron showers has 


The first, small ScECAL prototype with about 500 
channels was tested in 2007 using positron beams with 
energies from 1 to 6 GeV provided by the DESY-II 
electron synchrotron (Francis et al. 2013). The aim 


was to demonstrate the feasibility of a scintillator strip 
ECAL with MPPC readout. Large-scale tests of Hama¬ 
matsu MPPCs in a real detector had not yet been per¬ 
formed.The test served to establish calibration and cor¬ 
rection procedures for this novel type of photo-sensor, 
which were used in later ECAL prototypes, too. They 
follow the principles developed for the SiPMs of the AH- 
CAL, which are described in the next section. 

Calibration runs were performed several times during 
the beam time using a 3 GeV positron beam, where the 
absorber plates were removed from the detector. Events 
consistent with a single, non-showering particle passing 
through the prototype ScECAL were selected and used 
for the calibration. Each strip was then individually cal¬ 
ibrated with data for which the reconstruction of a track 
recorded in a drift chamber installed upstream showed 
that a particle passed through the strip. 

The MPPC signal is intrinsically non-linear due to its 
finite number of pixels (1600), which leads to a saturation 
of its response at high light intensities. If an input light 
pulse is shorter than the MPPC recovery time (^ 4 ns 
for MPPCs used in the prototype), the MPPC response 
can be parameterised by 


-^fired (-^p.e.) — -^pix(l ^ 


_ p-Np.e./N^: 


0 , 


(4) 


where V^red denotes the number of fired pixels, Vpix 
the total number of MPPC pixels, and Vp.e. the num¬ 
ber of photoelectrons created. If the input light pulse is 
longer than the MPPC recovery time, the effective dy¬ 
namic range is increased due to the possibility of a sin¬ 
gle pixel firing several times within the same light pulse. 
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This effect occurs particularly for the strips using WLS 
fibres of decay time ^8 ns. In contrast, directly read¬ 
out strips are not expected to have such enhancement 
because of the shorter decay time of the scintillator it¬ 
self (^2 ns), The single pixel signal of each MPPC was 
obtained from the spectrum measured in LED calibra¬ 
tion runs, so that A^fired could be calculated from the 
measured ADC counts. The response of the two types 
of scintillator strip-MPPC systems was measured using 
a dedicated apparatus using an ultraviolet LED to in¬ 
ject light into the strip and a photomultiplier to moni¬ 
tor the light yield, and the measured response functions 
were used to correct the saturation effect. To reduce 
these effects, MPPCs with more pixels 10000 pixels 
in an effective area of 1 mm^) have been developed in the 
meantime. 

Since the MPPC gain G depends on the temperature T 
{Ct = ^GjdT ^ 2%/K at 20°C), correction of this effect 
is essential. The dependence of each channel’s response 
on temperature A(T) was fitted with a linear function 

A{T) = A{To) -{l + CT-iT- To)) (5) 

where the reference temperature Tq was chosen to be 
20° C. In the analysis of the positron events the response 
of each strip was calibrated using the temperature- 
dependent calibration function determined by these fitted 
functions. 

Although plastic film was inserted into the pair of 
grooves between strips, adjacent strips in the same mega¬ 
strip were not perfectly optically isolated. The cross-talk 
between neighbouring strips was typically around 10%, 
with a relative variation of around 15% (RMS). Since 
the MIP calibration was defined without accounting for 
cross-talk, a simple sum over measured strip energies 
would give an overestimate of the deposited energy in 
terms of MIPs. A correction procedure was applied to 
estimate the cross-talk contribution from the amplitude 
recorded in adjacent strips and subtract it from the signal 
in each strip. 

Eig.[^ shows the measured energy spectra of 1 - 6 GeV 
positron events collected in the central region of the pro¬ 
totype for a detector configuration (E-D configuration), 
where the type-E module with WLS fibres was directly 
upstream of the type-D module without WLS fibres. 

The successful operation of several hundred MPPCs 
demonstrates that such a technique is feasible and rep¬ 
resents an important milestone in the development of 
a prototype scintillator strip-based ECAL. The applied 
temperature-based corrections to the MPPC response 
successfully stabilised the prototype’s response. The en¬ 
ergy response of this calorimeter prototype was measured 
to be linear to within 1% in the energy range between 1 
and 6 GeV. The stochastic terms in the various configu¬ 
rations and regions were measured to be between 13 and 
14%, while the measured constant terms are between 3 
and 4.5%. Depending on the true beam energy spread. 



FIG. 26 The measured energy spectra of 1-6 GeV positron 
events collected in the central region for a detector configu¬ 
ration (F-D configuration) in which the module with WLS 
fibres was d irectly upstream of the module without WLS fi¬ 
bres. From ( Francis et aL\^0T3 ). 


the intrinsic calorimeter performance may be better than 
this. The measured constant term is rather large and a 
simulation study shows that it has contributions from 
non-uniformity of the strip response and shower energy 
leakage due to the limited prototype size, as well as insen¬ 
sitive volume due to the MPPC package. It is expected 
to be reduced in a larger detector with less leakage, and 
strips with better uniformity 


The second physics prototype (CALICE 2012c) was 
built using individual small scintillator strips. The pro¬ 
totype was transversely twice as large as the first pro¬ 
totype, and the number of layers was increased from 26 
to 30. The physics prototype was explored with various 
types of beams: electrons up to 32 GeV to study the re¬ 
sponse to electromagnetic events, 32 GeV muons for the 
calibration, and charged pions of up to 32 GeV to study 
the hadron response in the combination with the analog 
hadron calorimeter (AHCAL) and the tail catcher muon 
tracker (TCMT).These beams were provided at the Fer- 
milab test beam facility in 2008 and 2009. 


Electron events collected in the central region (8cm x 
8cm) of the prototype were used to evaluate the linear¬ 
ity and resolution of the measured energy. Eig. 27 shows 
the deposited energy as a function of the momentum of 
incident beams. The solid line is the result of a linear fit 
to the data. The maximum deviation from linearity is 
1.6% at 20 GeV. Eig. ^ shows the energy resolution as a 
function of the inverse of the square root of the incident 
beam momentum. The intrinsic beam momentum fluc¬ 
tuation is estimated to be 2.7 ± 0.3% for 2-4 GeV and 
2.3 ± 0.3% for 8-32 GeV, respectively, and is quadrat- 
ically subtracted from the resolution. The curve shows 
the result of a fit to the data with a quadratic parametri- 
sation of the resolution. The intrinsic calorimeter reso¬ 
lution for electron beams, after subtraction of the beam 
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FIG. 27 Response of the ScECAL physics prototype to 2 - 
32 GeV electrons (top), and the deviation from the result o f 
a linear fit (bottom). From (GALIGE 2014 Uozumi 2014). 



FIG. 28 Energy resolution of the ScEGAL physics prototype 
to 2 - 32 GeV electrons as a func tion of inverse of square 
root of the beam momentum. From (|GALIGE[ |2014| |Uozumi 

Mil. 


momentum spread, was determined to be 

12.8 i 0.1[stat.) ± 0A[syst.) 


a/E = 


Ve 

1.0 ± 0.1{stat.)'^i'Q{syst.) 


The test beam results show that the physics prototype 
satisfies the requirements on the linearity and resolution 
of its electromagnetic response. 


D. Scintillator steel or tungsten AHCAL 


Following successful operation of a small tile calorime¬ 
ter with SiPMs (Andreev et al. 2005), the AHCAL pro¬ 


totype was the first device to use SiPMs on a large 
scale, with 7608 channels in total. Over seven years 
of test beam operation, and numerous transports be¬ 
tween DESY, CERN and Fermilab, the robustness of 
the technology was convincingly demonstrated. Since 
then, SiPMs have been adopted by several high energy 


physics experiments, e.g. Belle II (Abe et al. 2010a) 


and CMS (Lutz 2012) for the read-out of scintillators. 


and moreover they have conquered a broad range of ap¬ 
plications, where they replace classical vacuum photo¬ 
multiplier tubes, e.g. in medical imaging. 

Using noise data recorded over several years, the long¬ 
term stability of the novel photo-sensors was studied in 
great detail (Adloff et al. 2010a), and no sign of ageing 
has been found. There was a fraction of 2% of dead 
channels, due to initial bad soldering, which increased by 
0.5% after the transport from CERN to Fermilab. 


The response of each detector cell is calibrated (Adloff 
et al. 2011b| ) using the signal of MIPs, see Fig. [29j 


f;[mip] = a/Amip • /(^Mpixei) 


(6) 


where E denotes the visible energy in units of MIPs, and 
A the detector response measured in ADC counts. The 
MIP scale Amip is set by the most probable value of the 
pulse height spectrum for MIPs. The correction function 
/ accounts for the exponential saturation of the SiPM 
response (Eq. [^, which is due to the finite number of 
pixels and finite sensor recovery time, and linearises the 
response at cell level. The correction factor is equal to 
one for small amplitudes and depends only on the frac¬ 
tion of the sensor pixels fired. Its argument is the ampli¬ 
tude normalised to the pixel scale Apixei, or gain, which 
is extracted from the separation of single photo-electron 
peaks in the pulse height spectrum for small, LED in¬ 
duced, amplitudes; see Fig. 

The function / had been determined on the test bench 
for each SiPM individually, before mounting it on the tile. 
Measurements performed in situ with large LED signals 
indicated that the saturation level was 20% lower than 
on the test bench, which was confirmed by laboratory 
studies with assembled tile SiPM systems and traced to 
the fact that the SiPM surface was only partially illumi¬ 
nated by the fibre. The effect was corrected using an av¬ 
erage scale factor on the SiPM response curves, and must 
be accounted for in future test bench procedures. SiPM 
parameters, such as gain or efficiency of the Geiger dis¬ 
charge, depend on the over-voltage, and the excess of bias 
over break-down voltage, AV{T) = Ubias-Pbreakdown(T). 
The latter increases with temperature T, for the SiPMs 
of the AHCAL prototype by about 50 mV/K, and AU 
is about 2.5 V. Therefore both Apixei and Amip depend 
on temperature, with coefficients of about —2%/K and 
—4%/K, respectively, which is corrected for using the 
form of Eq. and the temperature recorded in each ac¬ 
tive layer during data taking (Adloff et al. 2014a). After 


correction, the residual dependence is at the level of a 
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FIG. 29 MIP distribution of muons with super-imposed fit 
and noise spectrum for reference (top), distribution of sin¬ 
gle pi xel peaks measured in gain calibration (bottom). Both 
from (Adloff et al. 2010a). 


few per-mil, see Fig. In principle, it is possible to 
actively compensate this dependence by adjusting Fbias 
according to T. 
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FIG. 30 Temperature gradient of MIP signa ls before and 
after correction, for each AHGAL layer. From (Adloff et al.\ 


2014a). 


The light yield of a scintillator tile traversed by a MIP 
corresponds to 13 fired pixels, on average, in the SiPM. 
In the test beam prototype, events were triggered by an 
external scintillator, and each channel was read out. In 
subsequent reconstruction, only signals with an ampli¬ 
tude above a threshold corresponding to 0.5 • Amip were 
retained. The efficiency of this noise cut for MIPs is 
90-95%. Above this threshold, the noise hit occupancy 
was about 2 • 10“^, and the summed noise amplitude 


corresponded to a few hundred MeV. These values, and 
the fine granularity, result in excellent imaging capabil¬ 
ities, which reveal the sub-structure of hadronic show¬ 
ers, see Fig. For example, tracks of charged parti¬ 
cles are clearly visible.These are subject to quantitative 
study later in this article. With more recent SiPMs, the 
noise occupancy is expected to decrease by one to two 
orders of magnitude, due to lower dark rates and steeper 
decrease of rate with threshold, thanks to suppression 
of inter-pixel cross-talk. In general, noise was not sub¬ 
tracted from data. Instead, for comparison with simu¬ 
lations, noise events recorded with random triggers were 
superimposed on the simulated events, before applying 
the threshold. 



FIG. 31 Event display of a 20 GeV pion taken from the 
online monitor. The beam enters from the right, the parti¬ 
cle traverses the EGAL, interacts in the HGAL and prod uces 
signals in the TGMT behind. Erom (Adloff et al. 2010a). 


The simulation of the detector response, so-called digi¬ 
tisation, takes the cell-to-cell variation of the calibration, 
the photo-electron statistics, SiPM saturation, electronic 
noise and optical crosstalk between tiles into account. 
The effects of non-uniformities in the light response over 
the area of a tile have been implemented for a dedi¬ 
cated study (Sefkow and Lucaci-Timoce 2010). There 
is a 100 jim wide zone along the edges of the tile where 
no light is produced, and a reduction of light yield at the 
positions of fibre and SiPM, due to reduced scintillator 
material thickness there. The effects on the response to 
electrons and pions were found to be negligible, which 


was confirmed for hadrons with test beam data (CAL¬ 


ICE 2013g). Thus, to save computing time, they are 


not simulated by default. The simulated events are pro¬ 
cessed through the same calibration and reconstruction 
chain as real data. The scale of the simulated response 
is adjusted by one global parameter to the MIP scale; no 
further tuning to data is applied. 

For the validation of the detector calibration and sim¬ 
ulation, positron induced showers recorded at the CERN 
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SPS test beam have been analysed, in an energy range 


from 10 to 50 GeV (Adloff et al 2011b). The sampling 


structure of the AHCAL corresponds to 1.24 radiation 
lengths Xq per layer and has an effective Moliere radius 
Rm of 2.47 cm. In order to minimise the noise con¬ 
tribution, the shower energy was summed up over cells 
within a cylinder of 5 Rm around the extrapolated track 
and over a length of 20 layers. The distribution, for a 
given beam momentum, is fitted by a Gaussian, with 
the peak position taken as the mean response, and the 
width as resolution. An electromagnetic energy scale fac¬ 
tor 42.3T0.4 MIP/GeV is extracted from a linear fit from 
0 to 50 GeV to the distribution of reconstructed versus 
beam energy. The reconstructed energy on the electro¬ 
magnetic scale as a function of beam energy is plotted in 
Fig. Systematic errors, indicated by the green band, 
are dominated by MIP calibration uncertainties, and, at 
the highest energies, by those on the saturation correc¬ 
tion. The deviation from linearity is less than 1% in the 
range 10 to 30 GeV, and 3% at 50 GeV. This indicates 
imperfections in the saturation correction, which in the 
future will be remedied with SiPMs with a larger dy¬ 
namic range, and with a more precise test bench charac¬ 
terisation. For the study of hadron shower development, 
this is sufficient, since for a given beam energy the single 
hit energy spectrum is much softer for hadrons than for 
electrons. 



FIG. 32 Linearity of the AHCAL response to positrons. 
Black dots represent data, blue triangles data before satu¬ 
ration correction, and open triangles show simulation results. 
The dashed line shows exact linearity for reference. The green 
band represents the systematic uncertainty. From (Adloff 


et al. 2011b). 


The resolution for electromagnetic showers is shown 
in Fig. 33 It agrees with that of a previous proto¬ 


type (Andreev et al, 2005), and the data together with 
the simulation, over an energy range from 1 to 50 GeV. 


The AHGAL resolution data is fitted to the function 


(Te _ 

^ ~ 71 


c 

E ‘ 


(7) 


The last term, representing the noise contribution, is 
fixed using random trigger events, to 58 MeV. The first, 
originating from stochastic fluctuations in the shower 
evolution, sampling and signal generation, is found to 
be a = (21.9 ± 1.4)%, in excellent agreement with simu¬ 
lation. The constant term 5 is a measure of calibration 
uncertainties, non-uniformities and instabilities, and is 
found to be b = (1.0 ± 1.0)%. Such effects are not in¬ 
cluded in the simulation. The result thus demonstrates 
that they are indeed very small and can be neglected for 
studies with hadrons. 



FIG. 33 Energy resolution of the AHCAL for positrons in 
data and simulation. The black triangles show data measured 
with an earlier prototype with the same sampling structure. 
Error bars represent the qua dratic sum of statisti cal and sys¬ 
tematic uncertainties. Erom (Adloff et al. 2011b). 


For the determination of the hadronic response, clean 
pion samples selected from data recorded at the same 
CERN SPS beam line (H6) have been analysed in the 
range from 10 to 80 GeV ( [Adloff et al. 2012c). The pri¬ 
mary inelastic interaction was required to be detected in 
the first layers of the AHGAL, by observing a significant 
increase of recorded energy and hit multiplicity over sev¬ 
eral consecutive layers. 

The reconstructed energy is obtained from the sum of 
the energies in the three calorimeters. 


Ehad = ^EC^L -(^HCAL + ^TCMt) , (8) 

TT 


where ^ecal measured energy deposited by the 

particle track in the SiW ECAL, using a conversion factor 
from simulations, validated by muon data. The HGAL 
energy on the electromagnetic scale, F^hcal, is obtained 
in exactly the same way as for electromagnetic showers. 
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as described above, except that the sum over the hit en¬ 
ergies extends over the entire calorimeter volume. The 
constant factor e/yr = 1.19 accounts for the fact that the 
HCAL is non-compensating and corresponds to the ra¬ 
tio of electron and pion response measured in the same 
analysis and averaged over the energy range considered 
here. The first 9 layers of the TCMT have the same 
sampling structure as the HCAL, so its energy L^tcmt is 
reconstructed in the same way. 

The energy distributions for a given beam energy are fit 
with a Gaussian in the interval ±2cr, and mean and width 
of the fit are taken as mean reconstructed energy and res¬ 
olution. The linearity of the response to hadrons, selected 
to start showering early in the HCAL, is within 2% in the 
range 10 to 80 GeV. The resolution is shown in Fig. 
as a function of beam energy. The data are compared 
to simulations using two recent physics lists in Geant4 
version 9.4, which will be described in more detail in Sec¬ 
tion IvD Each set of resolution measurements is fit with 
a function according to Eq. The noise term is fixed to 
0.18 GeV as measured with random trigger events and is 
dominated by the TCMT; the noise in the HCAL alone 
corresponds to 0.06 GeV. Eor data, the stochastic term is 
(57.6±0.4)%, and the constant term is (1.6±0.3)%. The 
simulations predict a somewhat smaller resolution at low 
energies, and in the case of ETEP_BERT a worse resolu¬ 
tion at high energies. In both cases, this leads to smaller 
stochastic and larger constant terms, 49-52% and 4-6%, 
respectively. Overall, the measured resolution falls into 
the range of expectations based on recent shower models. 


, 1 , , , , 1 , , , , 1 , , , , 

1 ' ' ' ' 1 ' ' ' ' 1 ' ' ' ' 1 ' ' ' ' 1 ' ' ' ' I 

^ \ 

Data: n ^ 

L \\ 

Data: : 

--- QGSP_BERT:71- - 


- QGSP_BERT:7i+ : 

r \\ 

FTF_BIC:71- “ 

r tv 

FTF_BIC:71+ J 

- 

A 



r CALICE 


, 1 , , , , 1 , , , , 1 , , , , 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 


h 10 20 30 40 50 60 70 80 90 


Ebeam[GeV] 


FIG. 34 Uncorrected hadronic energy resolution for data as 
well as simulations, us ing two physics lists . The curves show 
fits using Eq[^ From (Adloff et al. 2012c). 


Adloff et al. 2012c 


The performance of the AHCAL with tungsten ab¬ 
sorber has been evaluated at the GERN PS and SPS 
test beams, respectively, in the energy ranges 1 to 


10 GeV (Adloff et al 2014a) and 10-100 GeV (GALIGE 


2013c). Also here, the event selection ensured a start 


of the shower evolution in the first few detector layers. 


Cerenkov counter information and topological cuts were 
used to obtain pure electron, pion and proton samples. 

The sampling structure of the tungsten absorber has 
been optimised for high energy jets and a given total 
thickness of the detector, to provide good shower con¬ 


tainment and to fit inside the magnetic coil (Linssen 


et al. 2012). It is similar to that of the steel prototype in 


terms of nuclear interaction lengths, but much coarser in 
terms of electromagnetic radiation lengths: one layer cor¬ 
responds to 0.13 A/ and 2.8 Xq. Therefore it is expected 
that the electromagnetic energy resolution is poorer than 
for steel, and that the intrinsic resolution for hadrons is 
somewhat worse, too. Moreover, since electromagnetic 
showers are more compact and sampled in fewer layers, 
the impact of single cell calibration uncertainties is larger, 
which is reflected in larger systematic uncertainties, in 
particular for electrons. Given the smaller Moliere ra¬ 
dius, the transverse granularity needs to be re-optimised. 
Eor electron data in the lower energy range, a stochastic 
resolution term of (29.6 ±0.5)% is measured, in excellent 
agreement with simulations, which predict (29.2 ±0.4)%, 
as is the constant term of (0.0 ± 2.1)%,. The hadron res¬ 
olution is also found to be as expected, with a stochastic 
term of 63%, while the constant term is not well con¬ 
strained at these low energies. 

It is noteworthy that the tungsten scintillator com¬ 
bination is nearly compensating and gives very similar 
response for electrons, pions and protons above 3 GeV. 
This response as a function of available energy is shown 
in Eig.j^for data. Here, the available energy represents 
the energy which can be measured in the calorimeter; 
for pions and electrons it corresponds to the particle en¬ 
ergy, for protons it is given by the kinetic energy. The 
measurements are very well reproduced by simulations, 
including the deviation of the electron data from the pion 
extrapolation, which are much smaller than in the case of 
a steel absorber with ^ = 1.19. Preliminary results from 
the higher energy range confirm the same behaviour up 
to energies of 100 GeV, and an excellent linearity, see 
Eig. 56 of (|GALIGEl[2013c). 


E. RPC steel or tungsten DHCAL 

The test beam activities of the DHGAL started in 2010 
at Eermilab with steel absorber and were completed by 
2012 at GERN with a tungsten absorber structure. At 
the start of the test beam campaigns, the tail catcher 
was equipped with the same scintillator strips which had 
already been used in tests with the AHGAL. As the tests 
progressed, they were gradually replaced with RPG lay¬ 
ers, of the same design as in the main stack. Part of the 
data was taken with the GALIGE silicon-tungsten EGAL 
placed in front of the DHGAL main stack. 

A dedicated run was performed without absorber 
plates. In this case, the 2 mm steel and 2 mm copper 
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FIG. 35 Dependence of the mean visible energy on the avail¬ 
able energy for e^, tt^ and protons. The line indicates a linear 
ht to the TT^ data, and the bottom part shows the residuals 
relative to thi s ht. The error band s represent the total uncer¬ 
tainty. From (Adloff et al. 2014a). 


cover plates of the detector cassettes together with the 
glass and readout boards of the RPCs served as the only 
absorber material. Thus, each layer corresponded to a 
thickness of only 0.4 radiation lengths or 0.04 interaction 
lengths. The minimal amount of absorber material pro¬ 
vided the most detailed event pictures recorded with the 
DHCAL, see Fig. 



FIG. 36 Event displays of a 120 GeV proton interacting in 
the DHGAL. 


Muon events were collected with the 32 GeV secondary 


beam and a 3 m long beam blocker placed into the beam. 
Muons provided an excellent tool to monitor the perfor¬ 
mance of the detector elements, i.e. the MIP detection 
efficiency and the average pad multiplicity of the RPCs. 
To measure these, either tracks in the entire DHCAL 
or track segments spanning only five layers were recon¬ 
structed (CALICE 2013a). To avoid a bias of the mea¬ 
surements, in either case, the layer for which the perfor¬ 
mance parameters were assessed, was not utilised in the 
track reconstruction. Fig. shows the efficiency, aver¬ 
age pad multiplicity, and the product of the two, and 
the calibration factors (after normalisation to the aver¬ 
age value over the entire detector) as function of layer 
number. This very uniform performance was obtained 
in the run without absorber plates, where the cooling of 
the cassettes was made easier due to the large gap be¬ 
tween detector elements. When placed in the absorber 
structure the uniformity of the response was not quite as 
good. 
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FIG. 37 The performance parameters of the DHGAL as func¬ 
tion of layer number (top), hit multiplicity di stribution (bot¬ 


tom). From (GALIGE 


2013a 


Repond 2014). 


For purposes of tuning the simulation of the RPC re¬ 
sponse it is useful to measure the average muon response 
per layer. Fig. [37| shows the response per layer averaged 
over all layers in the DHCAL. The data are compared to 
Geant4-based simulations using a two-exponential model 
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to describe the induced electrical charge in the pad plane. 
A satisfactory agreement between data and simulation is 
observed. 

The muon response is utilised to equalise the response 
of the individual RPCs in the DHCAL ( CALICE||2Q13d ). 
Different schemes are being investigated: a) full cali¬ 
bration (application of calibration factors obtained from 
the efficiency and average pad multiplicity), b) density 
weighted calibration (application of the same calibration 
factors as in the full calibration, but to each hit individu¬ 
ally and weighted by a function dependent on the density 
of hits surrounding a given hit), and c) a hybrid of the 
two approaches. In all three cases, the calibration proce¬ 
dure results in a reduced spread of the mean response in 
runs of the same beam energy and in distinct improve¬ 
ments to the linearity of the response as function of beam 
energy. 

Using the density-weighted calibration Fig. shows 
the average number of hits as function of the pion beam 
energy, as measured with the steel absorber plates in the 
Fermilab test beam (CALICE 2013d). The response is 


fitted with a power law, N = a • E'^. The exponent 
of 0.974 indicates a slight saturation, compared to an 
exponent of unity corresponding to a perfectly linear re¬ 
sponse. The resolution is also shown in Fig. As ex¬ 
pected, due to the finite size of the readout pads and the 
ensuing saturation at higher energies, at energies above 
about 35 GeV the resolution remains constant and fails 
to improve further with higher energies. Below 30 GeV 
a fit to C0 a/\fE results in a stochastic term of 64%, in 
good agreement with expectations based on simulation. 

The responses to positrons and pions of the same mo¬ 
mentum are different in the DHCAL, see Fig 39 At 


momenta below about 4 GeV/c the positrons provide a 
higher response (larger number of hits), which is a typical 
feature of non-compensating calorimeters. On the other 
hand, at momenta above 8 GeV the response to positrons 
is suppressed due to the finite size of the readout pads, 
the high density of electromagnetic showers, and the en¬ 
suing saturation of the response. At these momenta 
the number of hits for pions exceeds the correspond¬ 
ing number for positrons, a phenomenon called over¬ 
compensation. In the intermediate momentum range the 
response of the DHGAL is approximately compensating. 

Even though the density-weighted calibration treats 
positrons and pions differently and takes into account 
the local density of hits when determining the calibra¬ 
tion constants, the procedure is not to be confused with 
software compensation, which attempts to correct for dif¬ 
ferences in the response to electromagnetic or hadronic 
sub-showers. The calibration procedure only corrects for 
differences in the overall performance of individual cham¬ 
bers, such that after the correction all chambers show 
the same average response. Thus the calibration fac¬ 
tor for a specific hit in an individual RPG depends on 
a) the difference in performance of the RPG to the av- 
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FIG. 38 Response and resolution for pio ns as measured at 
Fermilab with steel absorber plates. From (|Bilki[ |2013| |CAL- 
ICEl|2013d| ). 


erage performance of the stack b) the local density of 
hits surrounding the specific hit and c) the particle type. 
Therefore, unlike for software compensation, if a given 
chamber’s performance is exactly average, no corrections 
are applied. Obviously, if the procedure were to be ap¬ 
plied to digitised Monte Carlo data, also no corrections 
would be applied. 

Due to the higher elevation of the CERN site, the 
RPCs showed a higher gain at CERN than at Eermilab 
(see also (Bilki et al. 2Q1Q| ) for details on the environmen¬ 
tal dependence of the RPC response). To obtain a similar 
performance, the default high voltage was decreased from 
6.3 to 6.1 kV ( C ALICE} |2Q12a[ ). The increased thick¬ 
ness in radiation lengths of the tungsten absorber plates, 
compared to the steel plates used at Eermilab, resulted 
in a highly suppressed electromagnetic response. Eig. [40] 
shows the response as function of energy as measured at 
the CERN PS. The electron response is seen to be sig¬ 
nificantly smaller than the hadronic response. In other 
words, the W-DHCAL is seen to be overcompensating 
in the entire energy range. Eurt her more, the hadronic 
response is approximately 30% smaller than the corre¬ 
sponding response measured at Eermilab with steel ab- 










































24 


DHCAL Response with Fe Absorber 



FIG. 39 The ratio of the electromagnetic to hadronic re¬ 
sponse in the simulated Fe-DHCAL as function of particle 
momentum. 


sorber plates. Over the entire energy range the hadronic 
response is seen to saturate, with an exponent of m = 


0.90. Fig. 40 shows the resolution for electrons, muons 
and pions as function of beam momentum. Due to the 
smaller hit count, the pion resolution is somewhat de¬ 
graded compared to the case with steel absorber plates. 
However, in the context of a PFA, it should be recalled 
that only the neutral hadron component of a jet, on the 
average 10%, is measured directly in the calorimeter. 

In general, the DHCAL performed very well in the var¬ 
ious test beams. Nevertheless, there were a small number 
of design and operational issues. 

a) Lower efficiency at the edges of the chamber. 

b) Loss of efficiency over time for some chambers, pos¬ 
sibly due to the high-voltage contact. 

c) Environmental dependence of the response, mainly 
affecting the pad multiplicity (Bilki et al. 2010). 


The Fermilab and CERN test beam set-ups have been 
simulated with a Monte Carlo program based on the 
Geant4 package and a standalone program RPC SIM for 
the simulation of the response of RPCs. The spatial co¬ 
ordinates of any energy deposition in the gas gap of an 
RPC was recorded for further analysis. In the following 
these energy deposition locations are named points. 

Eor each generated pointy the RPC SIM program gen¬ 
erates a signal charge Q, distributes this charge over the 
pads, sums up all charges on a given pad, and applies a 
threshold T to identify the pads with hits. 

The signal charges are generated according to the mea¬ 


sured (Drake et al. 2007) spectrum of avalanche charges. 


as was obtained with cosmic rays. The induced charge in 
the plane of the readout pads is assumed to decrease as 
a function of lateral distance R from a given point. This 
decrease is parameterized as the sum of two exponentials. 



Beam energy [GeVl 


FIG. 40 Response (number of hits) and resolution for elec¬ 
trons, muons and pions as measured in the PS with tungsten 


absorber plates. From ( |GALIGE||2012a| Repond 2012). 


The emulation of the RPC response depends on six 
parameters, of which five were tuned to reproduce the 
distribution of hits per layer as measured with muon 
tracks. The sixth parameter, related to the suppression of 
avalanches close-by to other avalanches, was tuned such 
as to reproduce the response to positrons. 


With all parameters of the RPCSIM program tuned, 
the program is now ready to predict the response to 
hadron beams, without having lost its predictive power. 
The comparison of the measured and simulated response 
to pions is ongoing and will be subject of future publi¬ 
cations. However, in a first iteration, the RPCSIM pa¬ 
rameters were tuned to the data collected with a small- 
scale DHCAL prototype, the so-called Vertical Slice Test 


(VST) ( [Bilki et al 2008 2009a|b|c 2010). Due to the 


limited depth of the VST only the forward part of 
hadronic showers could be measured, while most of the 
energy leaked out the back of the stack. Nevertheless, 
the distribution of the number of hits could be well re¬ 
produced with Geant4 and the tuned RPCSIM program, 
giving confidence in the overall approach to simulate the 
DHCAL response. 
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F. RPC steel SDHCAL 


The SDHCAL, featuring three readout thresholds en¬ 
coded in 2 bits per cell instead of only one, has been 
subject to a large scale test beam campaign in 2012. 
The current main results of the performance are sum¬ 
marised in (CALICE 2012b). No notable problems are 
reported from this running period. The minor shortcom¬ 
ings concern coherent noise and a small number of dead 
ASICs. At first sight, the SDHCAL is quite similar to the 
DHCAL. However, the detector readout is significantly 
different. The HARDROC ASIC used in the SDHCAL 


is already adapted to the expected mode of operation at 
the future ILC. Operated in self-triggering mode it stores 
up to 128 events in an internal buffer. The entire detec¬ 
tor has to be read out if the buffer of any of the ASICs 
is full. This requires an excellent control of the noise 
level of the detector. Fig. shows in its top part the 
hit spectrum as a function of elapsed time. The num¬ 
ber of hits attributed to physics events is significantly 
different from the number for noise events. The subdi¬ 
vision of the X axis into 200 ns bins is motivated by the 
clock frequency of the HARDROC ASIC. The bottom 
part shows the number of noise hits within 200 ns. This 
spectrum has its maximum number of entries at zero hits 
and the resulting average is 0.35 hits/200 ns. 

The hit efficiency and the hit multiplicity are im¬ 
portant benchmarks for the performance of a gaseous 
calorimeter. As shown in Fig. for the CALICE SD¬ 
HCAL an efficiency of about 95% and a hit multiplic¬ 
ity of about 1.7 hits have been measured with muons 
crossing the detector. These numbers are independent 
of the layer. It is interesting to note that these numbers 
have been confirmed with track segments within hadronic 
showers. They depend on environmental conditions in a 
similar way as for the DHCAL. 

An important cross check of the performance of the 
CALICE SDHCAL are the results obtained in pure bi¬ 
nary mode. This will allow for judging the benefit of 
having additional thresholds and for comparisons with 
the CALICE DHCAL. The raw response of the calorime¬ 
ter is given in the top part of Fig. Saturation effects 
set in for an energy of the primary pion of about 30 GeV. 
To account for this non-linearity the reconstructed en¬ 
ergy is calculated as a function of the number of hits. 
This function takes the form E = (C + D • Nhu) * 
with C = 0.0543 and D = 9 • 10“^. By construction the 
reconstructed energy is linear as a function of the energy 
of the primary particle. The resulting energy resolution 
is shown in the bottom part of Fig. The shape of the 
resolution curve agrees with the qualitative expectation. 
It flattens out at energies above 30 GeV and approaches 
a value of about 15%. Both observations are broadly 
compatible with the results obtained for the DHCAL ac¬ 
cording to Fig. Understanding the different contri¬ 
butions to the resolution in terms of simulations, and of 
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FIG. 41 Top: Number of hits in physics events (green) and 
number of noise hits in the CALICE SDHCAL as function of 


time in units of clock cycles of the HARDROC ASIC. Bottom: 


Total number of noise hits within 200 ns. From (Boudry 2012 


CALICE 2012b). 


the differences between the prototypes, is still ongoing. 


By virtue of the HARDROC ASIC the SDHCAL im¬ 
plements three readout thresholds in order to moderate 
saturation effects in dense shower regions. A natural 
question is whether these thresholds allow for a true (even 
still rough) measurement of the different energy deposi¬ 
tions. This question is even more justified since the RPC 
chambers are operated in avalanche mode, and the mea¬ 
sured charge is only weakly correlated with the primary 
deposited energy in a given cell. Fig. [44| shows an 80 GeV 
pion that interacts in the SDHCAL. The incoming par¬ 
ticle acts MIP like in early stages of the shower before 
the interaction. The core of the interaction is populated 
with red coloured entries that represent high energy de¬ 
position. These regions of high energy depositions are 
surrounded by hits associated with smaller energy de¬ 
positions. Thus, the expected rough features of highly 
energetic hadronic showers are reflected by the multi¬ 
threshold read-out. Quantitative results using the three 


thresholds will be presented in Sec. IV.B 
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FIG. 42 Top: Average hit multiplicity in the CALICE SD- 
HCAL chambers as a function of the layer. Bottom: Average 
efficiency of the CALICE SDHCAL chambers as a function of 
the layer. Erom (Boudry 2012 CALICE 2012b). 


G. Tests with alternative technologies 


Additional alternative technologies have been sug¬ 
gested for implementing particle flow calorimetry. They 
have not been developed to the extent of those already 
described, but are included here for completeness as they 
may be used in actual future detector systems. 

For the ECAL, a highly granular solution has been 
proposed that uses Monolithic Active Pixel Sensor tech¬ 
nology. Three first-generation sensors for digital electro¬ 
magnetic calorimetry consist of 168 x 168 pixels with the 
required size of 50 microns by 50 microns. These sensors 


were tested (Stanitzki 2011) using sources and lasers, 


and a beam at CERN. The addition of a deep p-well fea¬ 
ture and a high resistivity epitaxial layer made the sensor 
close to 100% efficient for MIPs. 

For the GEM-based DHCAL approach, beam tests 
have been made with several 30 x 30 cm^ double-GEM 
foil chambers, with 1 cm^ anode pads at Fermilab. Both 
the KPiX and Argonne DCAL readout systems were 


tested. The important results (White et al 2012) of 


these tests were that the chambers operated stably in 
both beam types, and a hit multiplicity vs. efficiency 
curve that shows that this technology is a very good can¬ 
didate for use in a DHCAL - see Fig. 1^ 
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FIG. 43 Top: Raw response of the GALIGE SDHGAL as a 
function of the energy of primary pions. Bottom: Energy 
resolution obtained after co rrection for the non linearity a s 
explained in the text. Erom (Boudry 2012 GALIGE 2012b). 


For the thick-GEM option, single and double resistive 
well chambers, 10 x 10 cm^ were exposed to a muon beam 


at CERN. The resulting (Arazi et al. 2013) efficiency vs. 
hit multiplicity curves are shown in Fig.|46 showing that 
low multiplicity at high efficiency was obtained - again 
indicating the suitability of this approach for use in a 
DHCAL. 

Tests have been made with 1 m^ micromegas chambers 
in beam at CERN, with a goal of using this technology 
for a semi-digital HCAL.The limited spatial dimensions 
of the signal from the avalanche in the micromegas results 
in a low hit multiplicity - about 1.15, indicating the suit¬ 
ability of this technology also for DHCAL applications. 


Results on the hit multiplicity vs. efficiency (Chefdeville 


2012) are shown in Fig. 47 


In later tests, four 1 m^ micromegas planes were in¬ 
stalled in a RPC SDHCAL stack and data recorded. The 
distribution of numbers of hits in the planes vs. the po¬ 
sition of the planes in the stack was found to be in good 
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FIG. 44 Interaction of a pion with an energy of 80 GeV in the 
GALIGE SDHGAL. The top part shows the xz projection of 
the hit positions and the lower part the yz projection, respec¬ 
tively. Red coloured hits indicate hits in which a charge above 
the highest of three thresholds has been measured. Blue in¬ 
dicates hits with a charge deposition between the second and 
the third threshol ds and green those between th e first two 
thresholds. From (Boudry 2012 GALIGE 2012b). 


EIG. 46 Ghamber effic iency vs. hit mult iplicity for thick- 
GEM prototypes. Erom (Arazi et al. 2013). 



EIG. 47 The number of anode pads above threshold versus 
micromegas chamber efficiency. 



EIG. 45 The number of anode pads above thresho ld versus 
GEM chamber efficiency. Erom ( [White et al. 2012). 


agreement with expectations. 


and conceive the beam tests as combined set-ups from the 
beginning. The combination of energy measurements is 
discussed here, while the next section is devoted to the 
combined particle flow performance. 

The imaging power of the calorimeters offers additional 
means to improve the particle energy measurements. The 
fine segmentation provides ideal conditions for the appli¬ 
cation of software compensation methods, based on local 
or global energy or hit density. The energy reconstruction 
of a semi-digital calorimeter makes also use of weighted 
combinations of the hit multiplicities for each of the mul¬ 
tiple thresholds. Finally, leakage can be estimated on the 
basis of topological observables such as the shower start 
point or activity in the rear part of the HCAL. 


IV. WEIGHTING, SOFTWARE COMPENSATION AND 
COMBINED PERFORMANCE 

The detector performance for single hadrons and for 
jets depends on the combined resolution - for energy and 
topology - of ECAL and HCAL together, possibly com¬ 
plemented by a tail catcher / muon tracker (TCMT). It 
is a particular strength of the CALICE validation ap¬ 
proach to take an integral view of the calorimeter system 


A. Software compensation in the AHCAL 

For most calorimeters, the response, i.e. the ratio of the 
signal generated in the active part to the total deposited 
energy, is different for electromagnetic and for hadronic 
showers, as was discussed in Section [I| Therefore event- 
to-event-fluctuations in the electromagnetic content of 
the shower contribute to the hadronic energy resolution, 
unless the difference is compensated. This is done either 
by design, by using active and passive materials in an 
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optimised sampling ratio, such that the electromagnetic 
response is suppressed and the hadronic is enhanced, or 
by so-called software compensation. The latter method 
exploits the fact that for the heavy absorber materials of 
hadron calorimeters, the radiation length is much smaller 
than the nuclear interaction length, such that the elec¬ 
tromagnetic parts of the shower are much more com¬ 
pact and have a higher density. Therefore compensa¬ 
tion can be achieved at the reconstruction stage if en¬ 
ergy deposits in regions of high energy density are given 
a lower weight than those in less dense regions. This 
requires three-dimensional segmentation of the read-out. 
One of the first applications was in the CDHS exper¬ 
iment (Abramowicz et al 1981), it was further devel¬ 


oped and applied for the HI (Andrieu et al. 1993 


Issever 


et al. 2005) and the ATLAS calorimeter (Cojocaru et al. 


2004). 


Due to the fine 3D segmentation, particle flow 
calorimeters offer excellent conditions for the successful 
applications of such methods. A local (LC) and a global 
(GC) compensation method have been studied (Adloff 


et al.\ 2012c) using data with steel absorber and the same 


event selection as described in Section IIILDI The first 
method uses the local energy density in the shower to 
determine weights for each hit, while the second applies 
a global correction factor to the whole reconstructed en¬ 
ergy, which is based on the hit energy distribution in the 
shower. The weights and parameters for these methods 
have been obtained from data and then been used for 
independent data samples as well as for simulations. 

For the LC method, the weighted energy is recon¬ 
structed according to 


ElC = ^ECAL + “ ' (^HCAL,i ' ^i) + ^TCMT J (9) 

- see also Equation Here in addition uoi is introduced, 
the energy density dependent weight applied to the cell 
energy Ehcal,^- The density is here simply the cell en¬ 
ergy divided by the cell size. Fig. shows the energy 
density distribution for 40 GeV pion showers. The distri¬ 
bution is binned as indicated, and in each bin the weights 
are determined such that the energy resolution is opti- 
mised, by minimising = Ei(-®LC,i - Aeam)^- The re¬ 
sults of this procedure are also shown in Fig.|^for data. 
The weights depend not only on the hit energy, but also 
on the total particle energy. For interpolation between 
the bins, this energy dependence is parameterised. Since 
the true energy is not known a priori for data, the un¬ 
weighted energy is used instead for the determination of 
weights. It was found that no further iteration is neces¬ 
sary. 

In the global method, a correction factor Cgiobai is cal¬ 
culated for each shower from the ratio of the number of 
hits below a given threshold, and the number of hits be¬ 
low the mean hit energy for this shower. This factor is 
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FIG. 48 Hit energy distribution in the AHCAL for 40 GeV 
pion shower test beam data (top), optimal weights derived 
from data, as a f unction of energy den sity and particle energy 
(bottom). From ( Adloff et a/.[|201^ . 


smaller for events with a larger electromagnetic compo¬ 
nent and thus fewer low energy hits. In practice, the 
correction proceeds in two steps, where first the global 
correction is applied to improve the estimate of the en- 
ergy, F^shower = Cgiobai • (£^hcal + ^^tcmt), and second 
an energy-dependent correction function Pgiobai restores 
linearity: 


Eqc — -^ECAL E Pshower ’ Pglobal(Pshower) • (^0) 

The results of applying these procedures, after train¬ 
ing them on data, to independent data sets, are shown in 
Fig. 1^ For both compensation methods, the linearity 
of the AHCAL is preserved or improved, and better than 
±1.5%. The resolution is improved by 12 — 15%, depend¬ 
ing on energy. Numerical results of fits using Eq. are 
shown in Table iHIl A stochastic term around 45% and 



a [%] 

b[%] 

c [GeV] 

uncorrected (Eq. 

57.6T0.4 

1.6T0.3 

0.18 

local compensation (Eq. dI 

44.3T0.3 

1.8T0.2 

0.18 

global compensation (EqHTol) 

45.8T0.3 

1.6T0.2 

0.18 


TABLE III Stochastic, constant and noise term contribu¬ 
tions to the resolution of the CALICE AHCAL determined 
with a fit of Equation]^ to data. From ( Adloff et al.\ 2012c). 


(Adloff et al.\ 2012c 
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FIG. 49 Linearity and resolution for unweighted AH- 
CAL data, and for the two software compensation methods. 
From (Adloff et al. 2012c). 


a constant term below 2 % are achieved, which is also 
reasonably well reproduced in simulations. The results 
show that a particle flow calorimeter with high granu¬ 
larity can be realised which still has a very good purely 
calorimetric performance. Software compensation is still 
to be integrated into particle flow algorithm, where some 
improvement for the jet energy performance can be ex¬ 
pected, in particular at the lower end of the linear col¬ 
lider jet energy range, where the intrinsic HCAL resolu¬ 
tion dominates. In addition, it can help to sharpen the 
calorimeter cluster association to charged particle tracks; 
here one can use the measured momentum for the proper 
choice of weights. 


B. Semi-digital reconstruction 


As was shown in Section the spatial distribution 
of hits above the three thresholds in the RPC-SDHCAL 
meets the intuitive expectation. The reconstruction now 
seeks to exploit the additional information in order to 


improve linearity and resolution of the energy response. 


In (CALICE 2012b 2013b) the reconstructed energy 
Free is expressed as a function of the number of hits 
Ni, i = 1..3 above the three thresholds. In this approach 
Free is given by 


Free = aNi + I3N2 + 7A^3 


( 11 ) 


The coefficients are a function of the total num¬ 

ber of hits Nhit = Ni ^ N 2 F N^. The functional depen¬ 
dence is derived from the minimisation of a test vari¬ 
able that uses a subset of the data at each of the energy 


points of 10, 20, 30,40, 50, 60 GeV. According to (CAL¬ 


ICE 2012b) the best results are obtained by a 2nd order 


polynomial. The coefficients as a function are shown 
in Eig.[^ The parabolic dependency is most clearly vis¬ 
ible for the coefficient 7 associated with the highest read¬ 
out threshold. Optimising the weight for each threshold 
without strict relation to the cell energies, and letting 
the weights vary with total number of hits - or energy 
- contains some elements of software compensation, e.g. 
as applied in the local method for the AHCAL. In this 
context it is interesting to note that due to the parabolic 
dependency the relative importance of the highest thresh¬ 
old increases with an increasing number of hits that is in 
first order proportional to the deposited energy. 



FIG. 50 Evolution of the coefficient a (green), {3 (blue) and 7 
(red) as a function of the total number of hits. From (|GAL 
|ICE1 |2013b| |Steen| |M4| ). 


Using these coefficients the response of the detector 
for the example of pious with energies of 10 GeV and 
80 GeV is demonstrated in Fig.[^ At small energies the 
response is Gaussian like. At higher energies a low en¬ 
ergy tail reflects the loss of information due to saturation 
effects. The distributions are fitted with the Crystal Ball 
function (Gaiser 1982) that can take this information 
loss adequately into account. The mean of this Crystal 
Ball function is close to the nominal beam energy. 

As shown in Fig. the detector response is linear 
within 5% over an energy range between 10 and 80 GeV. 

The energy resolution over this energy is given in 
Fig. 1^ It seems that the semi-digital approach allows 
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FIG. 51 Energy reconstruction with the RPC-SDHCAL for 
pion showers of 10 and 80 GeV (right) using the threshold 
information. T he Grystal Ball function is fitte d to the distri¬ 
butions. From (GALIGE 2013b |Steen| 2014). 
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FIG. 52 (a): Mean reconstructed energy Ereco in the 

RPG-SDHGAL for pion showers and (b): relative deviation 
AE/Ebeam = {Ereco — Eheam) IEbeam of the piou mean rCCOU- 


structed energy from the beam energy Ebeam- From (GAL- 
ICEI |2013b| |Steen| ^0T ^ . 


for an improved control of the saturation effect that is 
expected to set in towards higher energies. Therefore 
semi-digital calorimetry might become a way to soften 
limits of a pure digital approach. 


For the micromegas option, a large scale prototype 
does not yet exist. The study of the benefit of additional 
thresholds beyond the pure digital approach is therefore 


so far based on Monte Carlo studies only (Chefdeville 


2013). There it is assumed that a second threshold is 


available for the energy information. In terms of the hits 
above thresholds 0 and 1, the reconstructed en- 


FIG. 53 The energy resolution 


of the RPG-SDHGAL 


^reco 

of the reconstructed pion e nergy Ereco as a function of the 
beam energy Ebeam- From ( GALIG^ 2013b Steen 2014). 


ergy can be written as 


Erec{No, iVi) = C(7Vo + DNi) (12) 


The coefficient C is a simple conversion factor from hits 
to energy derived from the linear part of the spectrum, 
i.e. where E^ec oc Nq. It is thus the coefficient D that 
quantifies the contribution of the number of hits above 
the second threshold. It is determined analytically from 
Eq. by requiring that the reconstructed energy E^ec 
equals the beam energy Ebeam ^ i-e. 

D = l/Ni{C-^Ei,eam-No) (13) 


The relative importance of the hits above the second 
threshold increases with energy. The benefit of an addi¬ 
tional threshold depends considerably on the actual value 


of the second threshold. This is demonstrated in Fig. 54 


The ’unnatural’ increase of the energy resolution towards 
high energies is inverted for a high second threshold, here 
set to 15 MIP. 

Digital and semi-digital reconstruction schemes have 
not yet been implemented into the Pandora particle flow 
reconstruction algorithm. As for the case of software 
compensation, the use of total energy or total number 
of hits per shower for the calculation of weights is non¬ 
trivial in the case of dense jet environments. 


C. Combining ECAL, AHCAL and TCMT 

In a collider detector, there is generally an electro¬ 
magnetic section in front of the hadron calorimeter, and 
ECAL and HCAL, and possibly TCMT, measurements 
must be combined in order to reconstruct hadron or jet 
energies. It is their combined performance that counts for 
physics, and it can be worse than that of the HCAL alone. 
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FIG. 54 Energy resolution of a micromegas-SDHCAL for a 
pure digital approach and two settings of the second thresh¬ 
old. The results are taken from a Monte Carlo stu dy of a 
large micromegas-SDHCAL. From (Chefdeville 2013). 


for example, if the ECAL is highly non-compensating. In 
any case, weighting factors are needed to account for the 
different response in each detector section. 

The response is expected to approximately scale with 
the sampling fraction, the ratio of visible to totally de¬ 
posited energy, and for the same type of active detector 
and passive absorber, with active and passive material 
thicknesses. In addition, there are surface effects, due 
to the short range of soft photons, which introduce a 
dependence on sampling frequency^ too. Moreover, the 
variation of the response with particle type results in a 
dependence on shower ‘age’, since the particle composi¬ 
tion in the early stages of shower evolution is different 
than in later stages; therefore optimal weighting factors 
may deviate from the first approximation according to 
sampling. 

In the CALICE prototypes, the response is measured 
in units of MIPs, and is then converted to the electro¬ 
magnetic scale in GeV, which gives the best estimate for 
electrons and photons. For hadrons, there is an addi¬ 
tional correction factor to account for the average c/tt 
ratio, 1.19 in the AHCAL case. To give a numerical ex¬ 
ample, in the AHCAL with steel absorber, the conversion 
factor is i?HCAL = (42 MIPs/GeV)-^ = 0.024 GeV/MIP, 
and for the first section of the silicon ECAL it is i^ECAL = 
(266 MIPs/GeV)“^ = 0.0034 GeV/MIP. In simulations, 
the most probable value of MIP energy deposition in the 
scintillator is found to be 816 keV, so the sampling frac¬ 
tion of the AHCAL is 3.4%. The absorber thicknesses of 
the first, second and third ECAL sections are 1.4, 2.8 and 
4.2 mm, respectively. The TCMT has the same absorber 
thickness as the HCAL in its first section, and 5 times 
more in the second. Therefore, a first approximation for 


the total hadronic energy would be 

^had = (^ECALl + 2 F^eCAL2 + 3F^eCAL3) * {R • e/7r)ECAL 
+ (^HCAL + ^TCMTl + 5 F^tCMT2) ' {R ’ e/7r)HCAL 
= O^l^ECALl + <^2^ECAL2 + <^3^ECAL3 

+ a4E'HCAL + <^5^TCMT1 + «6^TCMT2 (14) 


Note that in the case of the tungsten HCAL absorber the 
individual WHCAL response and e/yr has to be inserted. 

CALICE has collected data with the following com¬ 
binations: silicon ECAL + AHCAL, scintillator ECAL 
-|- AHCAL, silicon ECAL + DHCAL. Results are given 
here for the first (CALICE 2009b Simon 2010), while 
analysis for the others is still on-going. The data set is 
the same as the one studied in Section III, however, in 
the event selection, requirements on shower containment 
in the HCAL have been made. For the reconstruction 
of the energy, the six weighting factors - three for 
the ECAL, one for the HCAL and two for the TCMT 
- have been determined in an optimisation procedure, 
minimising = (E’beam - ELi 9,11 runs in 

the energy range from 10 to 80 GeV. From the result, 
three internal inter-calibration factors were extracted, 1 
: 1.124 : 1.629 for the ECAL and 1 : 4.55 for the TCMT. 
These were fixed and the external inter-calibration fac¬ 
tors determined with a second minimisation. The re¬ 
sult is given in Table IV In particular for the ECAL, 


Detector 

conversion factor [GeV/MIP] 

ECAL 

0.00827 

HCAL 

0.0293 

TCMT 

0.0337 


TABLE IV Conversion factors R • ejir from MIP to the 
hadronic energy scale for each detector in the complete CAL¬ 
ICE test beam set-up, as determined in a minimisation, 
see text. 

both the internal and external inter-calibration factors 
deviate significantly from the expectation based on the 
simple scaling considerations outlined above. This indi¬ 
cates that correlations in the shower and the shower age 
effects mentioned earlier do play a role. In principle one 
could use weights depending on the reconstructed shower 
start point, but this has not yet been tried. 

In addition to the method using single weights, the lo¬ 
cal software (LC) compensation method described above 
was extended to incorporate the ECAL and the TCMT. 
Weights depending on energy and energy density have 
been optimised using the same x^ minimisation, but 
keeping the internal inter-calibration constants from the 
single weight method. Both methods result in a very 
good linearity, with less than 5% excursion over the full 
range. The resolution is shown in Fig. for the two 
cases. For the combined set-up, it is almost as good as 
for the AHCAL alone, and the improvement using the 
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software compensation method is also very similar. The 
constant terms are slightly larger; however, for this pre¬ 
liminary study no temperature correction had been ap¬ 
plied yet. 



The degraded resolution can be recovered by adding 
the TCMT energies; however, this would not be real¬ 
istic, since in a real detector the coil between HCAL 
and TCMT represents un-instrumented material, typi¬ 
cally corresponding to 1.5 to 2A/. In order to obtain a 
realistic estimate of resolution improvement due to the 
TCMT, the effect of the coil material was emulated by 
excluding a corresponding number of TCMT layers from 
the energy sum. The resolution is measured as RMS, 
to maintain the sensitivity to the leakage-induced tails. 
Results for 20 GeV pions are shown in Fig. for cases 
with and without sampling behind the emulated coil, as 
a function of total ECAL plus HCAL absorber thick¬ 
ness. As expected, the effect is reduced as the calorime¬ 
ter thickness increases, but for a typical depth of about 
5.5A/ the improvement is still 6 to 8%. With rising en¬ 
ergy, the improvement gets more significant; at 80 GeV, 
it corresponds to 16%. 


FIG. 55 Hadron energy resolution for the combined ECAL 
-h AHCAL -h TCMT set-up, using single weights f or each de¬ 
tector, or energy den sity dependent weights.(From ( CALIC^ 


2009b Simon 2010)) 


D. Impact of the tail catcher 


Particle flow methods require the ECAL and the 
HCAL to be located inside the solenoidal coil, such that 
the total depth, and thus containment, of the calorimeter 
system is constrained by cost considerations and techni¬ 
cal limitations of the coil size. Even small shower leakage 
contributes significantly to the energy resolution, in par¬ 
ticular at higher energies, since its event-to-event fluctu¬ 
ations are larger than its mean value. Two means are 
at hand to correct each particle measurement: the en¬ 
ergy seen in the TCMT, and, thanks to high granularity, 
shower topology based leakage estimates. 

The impact of the TCMT on the hadronic energy reso¬ 


lution has been investigated in a dedicated study (Adloff 


et ai 2012b). Eirst, the amount of leakage has been di¬ 


rectly measured, by comparing the energies reconstructed 
in the the ECAL and HCAL alone with that measured 
in the complete set-up, including the TCMT. The data 
were taken at the CERN SPS in 2006 where only 30 ab¬ 
sorber layers of the AHCAL had been installed, so the 
first, finely segmented section of the TCMT was treated 
as part of the HCAL. This emulates a structure with a 
depth of 5.9 nuclear interaction lengths A/ for ECAL and 
HCAL, which is typical for ILC detector designs. In the 
energy range from 10 to 80 GeV, the fractional leakage 
increases from 3% to 8%, with RMS values from 8% to 
12%. The distribution of leakage energy exhibits long 
tails, such that the fraction of events with more than 
10% leakage ranges from 3 to 6%. 
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FIG. 56 Hadron energy resolution (RMS) with and without 
the TCMT after an emulated coil, as a function of ECAL plus 
HCAL absorber thickness. From ( [Adloff et al. 2012b). 


The impact of a tail catcher on the jet energy resolution 
depends on the abundance of high-energy particles with 
sizeable leakage, and therefore also increases with jet en¬ 
ergy. The TCMT has been included in simulation and 
reconstruction for the optimisation of the HCAL thick¬ 
ness of the ILD detector using the Pandora algorithm. 
In comparison to classical calorimetry, the particle flow 
approach drastically reduces the sensitivity to leakage, 
since charged hadrons are measured with the tracker and 
only neutrals are affected. Nevertheless, for jet energies 
above 100 GeV and calorimeter thickness below 6A/ (for 
the HCAL alone) the benefits of the TCMT for the jet 
resolution are clearly visible (Thomson 2009). 


E. Leakage estimation using the shower topology 

The detailed reconstruction of each shower shape pro¬ 
vides additional means to derive event-by-event correc¬ 
tions for leakage. However, one has to keep the large flue- 
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tuations in the evolution of hadronic showers in mind; 
a simple extrapolation of the visible part is not suffi¬ 
cient. Showers which apparently finished their develop¬ 
ment may ”re-appear” in a deeper detector section, since 
a high momentum neutral hadron carried leading energy 
away from a hard interaction to deposit it there, with¬ 
out leaving a trace in several interaction lengths of tra¬ 
versed material in-between. Still, the fraction of energy 
deposited in the rearmost part of the HCAL is statis¬ 
tically correlated with the amount of leakage, and the 
variation of the depth of the first interaction represents 
the largest single source of leakage fluctuations. 

These two observables have been used in a correction 
procedure developed in (CALICE 2011) and applied to 
data taken at energies ranging from 8 to 100 GeV with 
the ECAL, AHCAL and TCMT combined set-up at the 
CERN SPS in 2007. The starting point has been re¬ 
constructed by detecting the first layers with energy and 
multiplicity above thresholds, and the end fraction is the 
energy in the rearmost four AHCAL layers, normalised to 
the sum of ECAL and AHCAL deposits. Their correla¬ 
tion with the energy reconstructed in ECAL and AHCAL 
alone is shown in Eig.l^ In the case of 80 GeV pious, the 
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FIG. 57 Correlation between the energy, reconstructed in 
ECAL plus AHCAL, normalised to the beam energy, and the 
shower starting poin t (top), and end fraction (bo ttom), for 
80 GeV pions. From (CALICE 2011 Zutshi 2012| . 


correlation is found to be 60 to 70%, respectively. The 
figures clearly demonstrate the large fluctuations: events 
with an early start or a small end fraction still do ex¬ 
hibit large leakage in some cases; therefore a statistical 


combination of several observables is promising. 

The correlations are different for different energies.To 
obtain an unbiased procedure, an energy-dependent cor¬ 
rection is derived as follows: the data from all energies 
are combined and binned in two dimensions in the two 
observables starting point and end fraction. In each bin 
a correction function depending on the reconstructed en¬ 
ergy is extracted by parameterising the correlation with 
the beam energy with a polynomial fit. The function is 
flat and has values near one for early starts and small 
leakage, but exhibits a stronger energy dependence for 
late starts and large end fractions, where correction fac¬ 
tors increase from 1 to 4 with energy in the studied range. 
Fig. [58| shows the distribution of reconstructed energy in 
ECAL plus AHCAL before and after applying the cor¬ 
rection to an independent sample of 80 GeV test beam 
data. At this energy, the improvement in relative RMS 




energy ECAL+AHCAL corr [GeV] 


FIG. 58 Reconstructed energy in ECAL plus AHCAL, be¬ 
fore (top) and after (bottom) applying the correction based on 


shower start and end fraction, for 80 GeV pions. From (CAL- 
ICEl[MT||ziitshI|[M^ . 


resolution is 25%. At lower energies the degradation and 
thus the improvement is less pronounced. Similar results 
were obtained for Monte Carlo simulations. 

A possible future development would be to implement 
such corrections into the particle flow reconstruction al¬ 
gorithm, for example for a re-assessment of the required 
absorber depth. One should expect a somewhat reduced 
performance of the correction, since in a dense jet en¬ 
vironment both the determination of the end fraction 
and the reconstruction of the shower start for neutral 
hadrons will be disturbed. Yet, some net improvements 
should remain, in particular for energetic jets, which 
would be complementary to the information provided by 
the TCMT. It is unlikely that the TCMT can be substi¬ 
tuted completely by topological leakage estimation, be¬ 
cause the latter become ineffective for very late showers 
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or complete ” sail-through” of primary hadrons as MIPs. 
These rare cases may hardly affect the average jet per¬ 
formance, but they may become important for missing 
energy signatures of rare processes. 

These considerations of topological leakage estimation, 
and the relation of the fine hadronic calorimetry and the 
tail-catcher, are input to the process of ILC detector de¬ 
sign optimisation, which is ongoing. 

V. CALIBRATION ISSUES 

The highly granular particle flow calorimeters have 
channel counts, which exceed those of existing detectors, 
at the LHC for example, by several orders of magnitude. 
The question thus arises how these large numbers can be 
handled with respect to calibration and monitoring, what 
precision is actually required, and how it can be obtained 
and maintained. This section discusses general calibra¬ 
tion issues for particle flow calorimeters. Aspects specific 
to each technology are summarised in Section [in| on per¬ 
formance. The discussion is limited to the single particle 
energy calibration; corrections at jet level are part of the 
particle flow reconstruction and need to take tracking in¬ 
formation on an event-by-event basis into account. 

A. Calibration scheme 

Calibration as a general term is used for several as¬ 
pects of the calorimeter reconstruction. For the channel- 
to-channel normalisation we use the term equalisation, to 
be distinguished from the corrections of time-dependent 
effects, induced for example by temperature or pressure 
variations. Tracing such variations is called monitoring. 
Establishing an absolute scale in units of GeV is again 
a separate task, and different scales, electromagnetic, 
hadronic or weighted scales need to be distinguished. If 
applied at the particle level, they may depend on the 
clustering definition. Other corrections, such as for dead 
materials, may be applied at the particle level, too. 

The calibration of the electromagnetic and hadronic 
response of the calorimeter proceeds in the following gen¬ 
eral steps: 

1. Test bench characterization of sensor parameters at 
cell level 

2. Inter-calibration of the electronic response of all in¬ 
dividual cells using muon test beams, and conver¬ 
sion to the MIP scale 

3. Verification of the electromagnetic scale and linear¬ 
ity using electron beams impinging directly on the 
detector modules 

4. Determination of the hadronic response using 
hadron test beams 


5. Determination of combined ECAL and HCAL 
hadronic response, including weighting procedures 

6. Verification of dead material corrections at inter¬ 
module connections using hadron test beams 

7. In-situ validation and monitoring using kinematic 
constraints, tracker information and track segments 
in hadronic showers 

To a large extent these steps have been carried out for 
the test beam data, providing a basis for extrapolation 
to a full collider detector. 

B. Channel equalisation and energy scales 

Eor silicon and scintillator, the muon response defines 
the MIP scale. Eor gaseous detectors, the response is 
proportional to the product of efficiency and pad multi¬ 
plicity which are also determined with muon beams. 

The inter-calibration with muon beams must be done 
for all cells and all detector layers. Thanks to the modu¬ 
lar design, this can be done with the assembled modules, 
but also with the bare active layers before insertion into 
the absorber. Eor example, in the CERN test beam 12 
hours were needed for the AHCAL to acquire sufficient 
statistics on a stack with a square meter front face and 38 
layers. This would translate into about two months for 
an entire ILC detector, or less, if more layers are aligned 
after each other in the beam. The analysis of the cali¬ 
bration data can be massively parallelised. 

The response to electromagnetic showers on the MIP 
scale can be uniquely predicted by simulations and ver¬ 
ified in test beams with known energy. In practice, the 
electromagnetic scale is useful only for the linear range for 
electrons and hadrons, i.e for the silicon and scintillator 
detectors. Eor these also a hadronic scale can be defined, 
since the deviations from linearity are small (less than a 
few per-cent up to 80 GeV). Eor the gaseous calorime¬ 
ters, the hit multiplicity noticeably deviates from a lin¬ 
ear behaviour for energies above 30 GeV, and additional 
procedures or weighting techniques need to be applied to 
obtain a linear scale. 

The weighted energy scale depends on the applied algo¬ 
rithm. Software compensation methods and semi-digital 
reconstruction have been discussed in Section Ml Ad¬ 
ditional corrections will be necessary to account for un¬ 
instrumented regions or additional material from support 
structures, electronics and service lines, at the EGAL- 
HGAL transition and at inter-module boundaries. This 
must be extracted from simulations which need to be 
benchmarked in test beams with realistic EGAL and 
HGAL prototypes combined. Apart from the inter- 
calibration, which must be done for every individual ac¬ 
tive detector element, it is assumed that all studies ad¬ 
dressing the absolute electromagnetic and weighted scales 
can be done with single representative sample structures. 
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FIG. 59 Comparison between calibration constants obtained 
in tw o different data taking periods in 2006. Results are taken 
from (Repond et al. 2008). 


C. Monitoring and in situ techniques 


The above calibration scheme needs to be comple¬ 
mented by monitoring techniques in order to take time- 
dependent variations into account. The general approach 
is that if the MIP scale - or the MIP hit multiplicity - 
is maintained and under control, all derived scales are 
stabilised as well. 

Test beam experience has demonstrated that the MIP 
scale of the silicon-based ECAL is intrinsically stable over 
long time periods to the per-cent level, where the vari¬ 
ations are mainly due to different experimental condi¬ 
tions (e.g. cable length) at the beam test sites. The 
calibration constants showed no influence from external 
factors like temperature. The correlation of calibration 
constants obtained for different periods of data taking 
in 2006 (Repond et al 2008) are shown in Fig. 59 It 


was shown (Li 2009 Rouene 2014) that the correlation 


coefficient is 83.8% between the calibration constants ob¬ 
tained at FNAL in 2008 and at CERN in 2006. Consid¬ 
ering that many operations like mounting, un-mounting, 
and shipping occurred between 2006 and 2008, this high 
correlation coefficient demonstrates the stability with 
time of the SiECAL prototype. The same level of cor¬ 
relation exists between calibration constants derived for 
the beam tests in 2008 and 2011 at FNAL. The absolute 
calibration of the ECAL can be verified and adjusted by 
comparison with the tracker or using electrons and pho¬ 
tons kinematically constrained such as in Bhabha events 
or e+e“ decays. 

Variations of the MIP scale of scintillation detectors 
are mainly due to changes of the electronic response 
of the photo-sensor, induced by changed thermal condi¬ 
tions, which have successfully been corrected using the 


known temperature dependence; see Section III The 


photo-sensor gain stability can also be monitored by mea¬ 
suring the spacing between peaks in the pulse-height 
spectrum attributed to small, discrete numbers of regis¬ 
tered photo-electrons, which does not require LED light 
stability. In principle it is also possible to adjust the volt¬ 
age in order to compensate the temperature variation, 
and use the gain to watch the stability. The hit multiplic¬ 
ity of gaseous detectors mainly varies as a consequence of 
variations of temperature and pressure which are contin¬ 
uously monitored. The use of radioactive sources is not 
necessary according to present understanding. Changes 
in the amplification of the read-out chain were checked 
independently and found to be much smaller than those 
of the sensors. The pedestals of the read-out electronics 
are regularly monitored using random trigger events; this 
also detects and monitors dead or noisy channels. 

Due to the underground location of collider detectors, 
the orientation of the detector layers, the power pulsing, 
and due to the high granularity, cosmic rays might not 
be sufficient for monitoring the MIP scale in situ. How¬ 
ever, thanks to the excellent imaging capabilities of the 
calorimeters, MIP-like track segments can be identified in 
hadronic showers and used for calibration purposes, see 
Section [VLD[ The potential for in-situ calibration of the 
AHCAL in the ILD detector was studied in simulations 
- for details see (CALICE 2009a). Although typically 
two tracks are found in each shower which are used for 
the calibration of 20 cells, it is even at the Z resonance 
not possible to obtain a channel-by-channel calibration 
within realistic running times. However, the method is 
well suited for the determination of average corrections 
for a sub-section of the detector, e.g. a layer in a module. 


D. Required Accuracy 

A common feature of all particle flow calorimeter 
technologies is their relative insensitivity to any sort of 
stochastic calibration or alignment uncertainty for indi¬ 
vidual cells. The large number of cells required for the 
topological resolution is an asset rather than a burden, 
since the precision with which cell level information needs 
to be known scales with \/]V, where N is the number of 
channels contributing to a shower. Even in the case of the 
scintillator HCAL, which is the coarsest of the detectors 
considered, this amounts to about 10 cells per GeV. 

In contrast, coherent systematic effects must be cor¬ 
rected with higher precision, depending on the fraction 
of the detector affected. If it is the entire calorimeter, the 
required precision is given by the constant term aimed at, 
about 1% for the ECAL and 2-3% for the HCAL. The 
challenge of the high granularity is that time-dependent 
corrections cannot be applied at cell level, and cell-wise 
corrections require stability over time to reach statistical 
precision. On the other hand, since every cell is individ¬ 
ually read out, one is free to form averages over space or 
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time according to the specific problem, but finding the 
optimal averaging procedure and identifying the leading 
effects is often an involved analysis and intimately re¬ 
lated to understanding the detector and its systematics. 
The procedures needed in practice can only be developed 
from real data. Such studies form an important part of 
the test beam data analysis, and they are also the reason 
why each generation of prototypes must undergo beam 
tests at system level again to obtain realistic performance 
figures. 

Using fully detailed simulations of the ILD detector 
and reconstruction based on the Pandora particle flow 
algorithm, different scenarios of statistically independent 
as well as coherent mis-calibration effects have been mod¬ 
elled, affecting the entire AHCAL or parts (module lay¬ 
ers) of it. Purely statistical variations, like those arising 
from calibration errors or random ageing effects, hardly 
affect the energy resolution at all. However, they may de¬ 
grade the in-situ MIP calibration capability. From this, 
a moderate requirement of the inter-calibration stability 
to be ensured by hardware design of ±10% is derived. 

Coherent effects which could for example arise from 
uncorrected temperature variation induced changes of 
the response are potentially more harmful, if they af¬ 
fect the entire detector. However, these are easy to de¬ 
tect, and even a 5% variation only mildly propagates into 
the jet energy resolution. Systematic effects shifting sub¬ 
sections like layers are unnoticeable unless they exceed 
about 15%, comfortably in range of the in situ calibra¬ 
tion method accuracies. 


The validity of these simulation-based estimates has 
been verified by treating the AHCAL test beam experi¬ 
ment at CERN and FNAL like a collider detector, using 
cell-by-cell inter-calibrations only from data taking at a 
different site, under different conditions and after hav¬ 
ing it exposed to disassembly, transport and re-assembly 
influences (Abe et al 2010b CALICE 2009a). Apply¬ 
ing only in-situ monitoring techniques, the scale was re¬ 
established and the resolution reproduced, see Eig. 
Imperfections absent in any simulation showed up, but 
were successfully compensated using a combination of 
techniques. 


Overall, the high granularity and channel count pro¬ 
vides net advantages for calibration. On one hand, due 
to the law-of-large-numbers suppression of statistical ef¬ 
fects, the requirements on individual cell precision are 
very relaxed. Coherent effects, on the other hand, can 
be studied with any desired combination of channels, be 
it layers, longitudinal sections, electronics units or ac¬ 
cording to any other supposed hypothesis of systematic 
effects. The high degree of redundancy and the full in¬ 
formation for each channel provide maximum flexibility, 
without having to rely on intrinsic homogeneity as in the 
case of internal optical or analog summing. 


E 0.2l ' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I U 

cc _ 


^^ 0.15 

E 


E 0.1 

LU 

J0.05 

r\ 

t • 

: I 1 ! • 

_ • • • 

I • I 

-. .i..» .- 

0 

_ • 


is* : 

-0.05 

_ • _ 

- § - 



-0.1 

- - 


- • A,3f(Tcorr) - 

-0.15 

^ • A(T+Ucorr) ^ 

-0.2 

I From (CALICE,2009a) • A( G corr) - 

, 1 , , , , 1 , , , , 1 , , , , 1 , , , , 1 , , , , 1 , , , , 1 , , , , 1 

10 20 30 40 50 60 70 80 


beam energy [GeV] 


FIG. 60 Residuals from linearity of reconstructed hadron 
showers in the AHCAL, in the range 8 to 80 GeV: using cali¬ 
bration samples from muon runs at FNAL transported to the 
“collider” run conditions and the CERN calibration as refer¬ 
ence. Corrections were applied based on either temperature 
(T) and voltage (U), or on observed photo-sensor gain (G), 
and addition ally on the in-situ MIP stub correction layer-by- 
layer. From ( CALICE} 2009a). 


VI. TESTS OF GEANT4 SHOWER SIMULATION 
MODELS 

A. Shower simulation models 


The simulation of hadron shower evolution is chal¬ 
lenging because there is no single model that describes 
hadronic interactions over the full range of energies that 
play a role in each single shower, from the first hard in¬ 
teraction, possibly several TeV, down to the MeV range 
of nuclear binding effects. Therefore different approaches 
must be combined, and transition regions or even gaps, 
where none is strictly applicable, must be bridged in or¬ 
der to describe the energy deposition in a calorimeter 
in its entirety. The first simulation codes, e.g. in the 
GEISHA framework (Eesefeldt 1985| ), were based on phe¬ 
nomenological parameterisations (LHEP) of the interac¬ 
tion and aimed at reproducing average shower properties 
in calorimeters. However, energy conservation was not 
ensured event by event, so correlations within the shower 
were not described. 

In the past decade, significant progress has been made 
in replacing these empirical parameterisations by more 
fundamental, theory-driven models within the Geant4 
framework ( [Agostinelh et al\ |2QQ3 Allison et al ., 2006). 
The main models are high energy quark gluon string 
based models (QGS, FTP) for hard interactions in the 
range above 5-10 GeV, the Bertini (BERT) or binary 
(BIG) models for the intra/nuclear cascade in the inter¬ 
mediate region, and for processes below about 200 MeV 
de-excitation models including a pre-compound (P) nu- 
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cleus stage, and nucleon evaporation. These are com¬ 
bined into so-called physics lists, with the energy ranges 
depicted in Fig. 

In the transition regions models are selected randomly. 
The models are not tuned to calorimetric measurements, 
so-called ’’thick target” data, which give little hint on 
which detail needs adjustment, but rely on ’’thin tar¬ 
get” data, differential cross sections for the scattering of 
hadrons off nuclei, while thick target data represent the 
benchmark for validation. The development was strongly 
motivated by the needs of the LHC experiments and more 
recently also by the demands of particle ffow calorimetry. 
The focus was originally on describing mainly detector 
response, and shower extension, albeit with the coarse 
spatial resolution of LHC detectors. It is now moving to¬ 
wards internal structure, particle composition and even 
time evolution. 


At the time when the CALICE test beam programme 
was proposed, there were significant uncertainties in the 
simulation of shower shapes, and the predictions for the 
radius, for example, varied between models by up to 


a factor of two (Mavromanolakis and Ward 2004). In 


the meantime, following detailed comparisons with LHC 
test-beam data, major revisions of all stages of the sim¬ 


ulation code have been undertaken (Apostolakis et al. 


2010 Dotti et al. 2011) For example, the range of the 


string-based model has been extended towards lower en¬ 
ergy, and in particular the diffraction based Fritiof model 
does not need any recursion to the old parameterised 
models to bridge the gap to the low energy cascade mod¬ 
els. Also an experimental list was implemented, based on 
the chiral invariant phase space (CHIPS) model, which 
covers the full energy range, but is not finally tuned 
yet. With these improvements, a good description of 
the LHC test-beam data is achieved using the QGSP- 
BERT physics list, which is the most commonly used at 
the LHC. The detector response agrees within a few per¬ 


cent (Abat et al. 2010 Adragna et al. 2009). However, 


some issues remain, namely the radial shower extension 
is underestimated by about 15% (Adragna et al. 2010). 


A good description of global shower properties, except 
possibly for the radius, can thus be expected for the 
CALICE data, too. However, particle ffow reconstruc¬ 
tion makes use of the shower structure in much more de¬ 
tail, for example at the re-clustering or fragment removal 


stage. Even though the algorithms were shown (Thom¬ 


son 


2009) to be rather robust against the choices of 


physics lists, a test of the simulation at the level of de¬ 
tail which the fine segmentation of the CALICE proto¬ 
types allow, forms an essential basis of the validation of 
particle ffow calorimetry and supports the simulation of 
multi-particle states in complex detector configurations. 
As will be shown, new observables like the charged track 
multiplicity are becoming accessible which are intimately 
related to the internal shower structure. 


B. Hadrons in the silicon tungsten ECAL 


Applying a simple power law and given the depth of 
about one interaction length, about 60% of the hadrons 
contained in a jet will interact in the volume of the SiW 
ECAL. It is therefore important to understand the in¬ 
teractions of hadrons in the SiW ECAL. This has been 


addressed in Refs. (Adloff et al. 2010b), for energies 


between 8 and 80GeV and more recently (Bilki et al. 


2015b) for energies between 2 and 10 GeV. The latter test 


a region where many Geant4 physics lists feature a tran¬ 
sition between models for hadronic cascades, see Eig. 61 
The earlier results report on comparisons between data 
and hadronic shower models as implemented in Geant4 
version 9.3. 

Exploiting the longitudinal granularity of the calorime¬ 
ter, the interaction point of interactions of primary 
hadrons can be identified by comparing energy deposi¬ 
tions and hit densities in subsequent detector layers. The 
correlation between the reconstructed and the true inter¬ 
action point, as available from Monte Carlo simulation, 
is shown in the top part of Eig. for pions of an en¬ 
ergy of 20 GeV. A good correlation is seen. Interaction 
layers as found in data and in Monte Carlo simulation 
are compared in the bottom part of Eig. Here, the 
QGSP-BERT physics list is used. The good agreement 
found in this case holds also when testing other Geant4 
physics lists. The efficiency to find the interaction point 
correctly within two layers is around 80% for energies 
above 10 GeV and still as high as 60% for energies at 
2 GeV. At high energies the interaction point can be de¬ 
termined from an absolute energy increase after the in¬ 
teraction of the primary pion with the absorber material. 
Due to shower fiuctuations this absolute increase is less 
prominent at small energies. However, the high longi¬ 
tudinal granularity allows for measuring in addition the 
relative increase of the deposited energy and for deter¬ 
mining the interaction point by that means. Examples 
of the two scenarios are shown in Eig. 

The amount of overlap of showers generated by close 
by particles is governed by the transverse shower radius 
of the particles. In turn it is easy to understand that 
the amount of overlap infiuences directly the precision 
which can be achieved by PEA algorithms. It is thus 
of major importance that the transverse properties of a 
hadronic shower are well modelled by Monte Carlo sim¬ 
ulations. Eig. [64] shows the comparison of the transverse 
shower profile for 8 GeV and 30 GeV pions incident on 
the calorimeter surface. Again, the default physics list is 
chosen to be QGSP_BERT. 

The mean values of the introduced transverse profiles 
for several physics lists and energies are given in Eig. [6^ 
Erom here and from the earlier result it can be con¬ 
cluded that those models which implement the Bertini 


cascade (Guthrie et al. 1968) give a good description at 


low energies. At high energies however, all models pre- 
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FIG. 61 Energy ranges of shower simulation models used in Geant4 physics lists. 




FIG. 62 Top: Gomparison between the interaction layer 
reconstructed in the SiW EGAL and the interaction layer 
as extracted from the GEANT4 event record. This exam¬ 
ple corresponds to a 20 GeV 7r“ beam simulation, using the 
QGSP_BERT physics list. Bottom: Distribution of the re¬ 
constructed interaction layer in the EGAL for 30 GeV data 
(points), compared with Monte Garlo predictions usi ng the 
QGSP-BERT physics list (solid histogram). From (Adloff 
et a/.[[^10b ). 



z direction (layer number) 



FIG. 63 Top: Inelastic reaction of a primary hadron in the 
SiW EGAL with a sizeable release of energy and secondaries. 
Bottom: Inelast ic reaction leading o nly to a local sparse de¬ 
position. From (Adloff et al. 2010b). 


diet smaller shower radii than observed. 

The longitudinal profile is composed of contributions 
from several components. Fig. [66| shows the longitudinal 
profile as a function of the shower depth after the interac¬ 
tion point for pious with an energy of 12 GeV compared 
with the predictions obtained by the QGSP_BERT and 
FTFP_BERT physics lists. 

The profile of the Monte Carlo prediction is broken 
down into its various components. It is clearly visible 


that there are significant differences between the models 
and the data and between the models themselves. This 
is particularly true for short range components generated 
by heavily ionising particles such as protons. This first 
separation into various domains of the shower is contin¬ 
ued in the study presented in Fig. Here the energy 
depositions in different layers after the interaction point 
are shown in greater detail. Again, the early stages of 
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FIG. 64 Radial distribution of hits (energy weighted) for 
SiW ECAL data at energies of 8 and 30 GeV (points with er¬ 
rors) compared with Monte Garlo (solid histograms) using the 
QGSP-BERT p hysics list. The dist ributions are normalised 
to unity. Erom (Adloff et al. 2010b| . 



EIG. 65 Mean energy-weighted shower radius in the SiW 
EGAL as a function of beam energy. The data are compared 
with the predictions of simulations using different Geant4 
physics lists. Erom (Adloff et al. 2010b). 


the shower created by nuclear breakup are not well de¬ 
scribed by either of the models with discrepancies be¬ 
tween 10% and 20%. The list FTFP_BERT overshoots 
the data while both lists labeled as QGSP overshoot the 
data at smaller energies while they undershoot the data 
at higher energies. For bigger shower depth FTFP_BERT 



FIG. 66 SiW EGAL longitudinal energy profiles for 12 GeV 
7 r“ data (shown as points), compared with simulations using 
two physics lists. The mean energy in MIPs is plotted against 
the depth after the initial interaction, in units of effective 
1.4 mm tungsten layers. The total depth shown corresponds 
to ~ 20 Xq or 0.8 Aint.- The breakdown of the Monte Garlo 
into the energy depo sited by different pa rticle categories is 
also indicated. Erom (Adloff et al. 2010b). 


agrees with the data within 5% and is around twice as 
good as the QGSP based models. 


C. Hadron shower shapes in the scintillator AHCAL with 
steel and tungsten 

A large number of studies have been performed in or¬ 
der to test the simulation of showers in the scintillator 
AHCAL, in as much detail as possible, using data taken 
with steel and tungsten absorbers, with pious, protons 
and kaons. Both classical and novel observables were 
studied, and it is impossible to give here a full account 
of how well each model describes every given quantity 
or distribution. The purpose of the present discussion is 
rather to give an overview of which features have been 
the subject of validation studies, and to provide an over¬ 
all picture of the level of adequacy of the simulations. 

Events have been selected to provide pure samples of 
hadronic showers, starting in the AHCAL, i.e. leaving 
only minimum ionising traces in the ECAL, if that was 
installed upstream. Electron, muon and anti-proton con- 
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FIG. 67 Ratio of simulation to data for three different regions 
of the SiW ECAL longitudinal energy prohle for pions: layers 
1-3, dominated by nuclear breakup (top); layers 5-20, domi¬ 
nated by electromagnetic showers (centre); and layers 30-50, 
dominated by penetrating hadrons (bottom). From (Adloff 


et al. 2010b). 


taminations to the selected negative pion sample can be 
neglected. 

The detector simulation had been validated with elec¬ 
tron data as described in Section imi For the simula¬ 
tion of hadron showers, additional effects become impor¬ 
tant. Low energy neutrons can travel for long times, up 
to many microseconds, before they interact and possibly 
produce a signal in the detector. It depends on the read¬ 
out technology and timing characteristics of its electron¬ 
ics, whether this actually contributes to the measured 
energy. In the case of the AHCAL, a cut of 150 ns was ap¬ 


plied according to the shaping time of the pre-amplifiers. 
In addition, low energy protons, as produced by elastic 
scattering of slow neutrons in hydrogenous material, can 
in principle have very high local ionisation energy loss. 
However, in scintillator materials shielding effects lead to 
a saturation of light production. This is empirically de¬ 
scribed by Birks’ law (Birks 1964), which is implemented 
in the Geant4 simulation. Both effects have been found 
to be important for the proper description of the hadronic 
response of the AHCAL. If not taken into account, the 
signal contribution from neutrons is overestimated, which 
results in too wide radial profiles and too large delayed 
signals. For silicon or gas, which are much less sensitive 
to neutrons, this is less critical. 

For the analysis of longitudinal profiles in the steel AH¬ 
CAL (Adloff et al. 2013b), the capability to reconstruct 
the shower start from the three-dimensional hit distri¬ 
bution has been used to de-convolute the distribution of 
average energy per layer into a spectrum of depth of the 
first hard interaction, and a profile measured from this 
first interaction point. This provides a cross-check of the 
material composition and nuclear absorption properties 
in the simulation, and a profile which is more directly 
sensitive to the physics processes at different stages of 
the shower evolution, as already seen in the silicon tung¬ 
sten ECAL discussed above. An example is shown in 
Fig. HU 

From a fit of an exponential function to the distribu¬ 
tion of the starting points, the effective pion interaction 
length has been extracted. As expected, there is no visi¬ 
ble dependence on energy in the range from 8 to 80 GeV. 
The average is = (26.8 ± 0.46) cm, where the error 
includes statistical effects as well as systematic variations 
due to the choice of the fit range. The result is in good 
agreement with expectations based on the material com¬ 
position of the detector as used in the simulation, which 
gives 28 cm, and also provides a cross-check of the start¬ 
ing point reconstruction algorithm. It is also well repro¬ 
duced by recent physics lists, within 4-6%, but not by 
the outdated LHEP parameterisation. 

The comparison of the shower profiles in Fig. [68| shows 
that the profile measured from the shower start is not 
only more compact, but also smoother, since layer-to- 
layer variations in the detector response are averaged out. 
These variations are due to different numbers of channels 
with bad connections or failed calibrations, which are not 
corrected for, but included in the simulation. The com¬ 
parison with simulation models is made with the distri¬ 
butions from the shower start, and systematic uncertain¬ 
ties due to such detector effects are obtained for each bin 
by comparing the distributions for different, but fixed, 
starting points in the detector. 

The profiles for three different energies are shown in 
Fig. together with simulations and their decomposi¬ 
tion. The method of decomposing the simulated shower 
energy depositions is described in (Kaplan 2011). The 























41 



Shower starting point z [mm] 



FIG. 68 Top: Distribution of the measured layer of the 
first hard interaction in data and simulation, both normalised 
to the same arbitrary number of events. Bottom: Longi¬ 
tudinal profiles as a function of distance from the AHCAL 
front face and from the reconstructed shower start point, 
both normalised to unity. The figures are for 45 GeV pions. 
From (lAdloff et a/.| |2013b|). 


electromagnetic component dominates the shower maxi¬ 
mum, while hadronic and electromagnetic depositions are 
comparable in magnitude in the start and in the tail of 
the shower. At 8 and 18 GeV, the profiles are described 
within a few per-cent, whereas at 80 GeV, the simulated 
showers have a somewhat more pronounced maximum. 
This is very similar for the QGSP and QBBC physics 
lists, too. 

The position of the maximum, and the centre-of- 
gravity (energy-weighted mean longitudinal position of 
the shower), are well reproduced for all physics lists, 
as shown for one example in Fig. The energy de¬ 
pendence of the energy-weighted mean shower radius is 
also shown, together with the FTFP-BERT prediction. 
The radial shower profile is less well reproduced: here 
the CALICE results confirm the findings of LHC test- 
beam experiments. All physics lists overestimate the 
core, which is dominated by electromagnetic depositions, 
and under-estimate the tail of the approximately expo¬ 


nentially falling radial distribution. The disagreement in¬ 
creases with energy and exceeds 10% at the upper edge 
of the probed range. While this leaves room for improve¬ 
ment in the shower modelling, we like to recall that the 
systematic studies in (Thomson 2009) have shown that 
the particle flow performance is rather insensitive to the 
choice of the physics list from a rather broad set. 

The studies made with tungsten as AHCAL absorber 
are essential for the validation of the CLIC detector per¬ 
formance, but they are also relevant for the ILC calorime¬ 
ters, since all electromagnetic sections are made from 
tungsten and will influence the development of the ma¬ 
jority of hadronic showers, even if the HCAL itself is 
made from steel. However, the ECAL prototypes them¬ 
selves are not large enough to measure the spatial ex¬ 
tent of hadron showers. The interactions on the heavy 
and neutron-rich absorber nuclei produce a significantly 
larger number of neutrons than in steel, such that the 
shower composition is different, and the validation of 
simulations for steel cannot be straightforwardly extrap¬ 
olated to tungsten, in particular not in the case of hy¬ 
drogenous scintillator material sensitive to soft neutrons. 

The tungsten data, recorded below 10 GeV (Adloff 


et al. 2014a) and between 10 and 100 GeV (CALICE 


2013c), present a picture which is very similar to the 


steel data, in terms of model comparisons. The simu¬ 
lations give somewhat too pronounced longitudinal and 
radial shower maxima for pions at high energies, but oth¬ 
erwise there is very good agreement, including the overall 
response and linearity; see also Section [Hi] Two example 
profiles are shown in Eig. The top part can be com¬ 
pared with the 18 GeV data in Eig.[^. The longitudinal 
shower shape in tungsten is different from that in steel, 
even if displayed as a function of depth measured in units 
of A/, because the ratio of interaction length to radiation 
length Vo is very different, about 25 for tungsten ver¬ 
sus 10 for steel. Therefore the electromagnetic part of 
the shower is more compact than the hadronic, resulting 
in a more pronounced shower maximum and an overall 
somewhat shorter shower. It is interesting that on the 
other hand the mean shower radius, measured in mm, 
is very similar for both materials, which reflects the fact 
that the transverse shower scale is not governed by the 
nuclear cross sections but by other effects, which are less 
dependent on the mass of the nucleus, like the angular 
distribution of the scattering products. 

Eor a proper modelling of particle flow performance 
it is important that not only the mean longitudinal and 
radial extent of the showers are reasonably well repro¬ 
duced, but also the event-to-event (or shower-to-shower) 
fluctuations. As an example, Eig. [72] shows the distribu¬ 
tion of energy-weighted mean z position and radius. As 
already stated, the mean radius is underestimated, but 
the widths of both distributions, characterising the typ¬ 
ical event-to-event variation of these quantities, are well 
reproduced. 
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FIG. 69 Mean longitudinal shower profiles from shower starting point for 8GeV (left column), 18 GeV (center column) and 
80GeV (right column) pions. First row: For data (circles) and for the FTFP_BERT physics list (histogram). Second row: 
Ratio between Monte Garlo and data for several physics lists. All profiles are normalized to unity. The grey area indicates the 
systematic uncertainty on data. (Erec)/AAi is the average deposi ted energy in a AAi thick transverse section of the calorimeter, 
z is the longitudinal coordinate, expressed in units of Ai. From (Adloff et al. 2013b). 


In an ideal particle flow detector, the hadron calorime¬ 
ter would measure the energies of only neutrons and 
mesons. Although in each shower many types of hadrons 
are present, they inherit some properties from the inci¬ 
dent particle. For example, neutron - or proton - induced 
showers are expected to have a smaller electromagnetic 
component, when compared to pions, since baryon num¬ 
ber conversation reduces the phase space for tt^ produc¬ 
tion, and strangeness conservation plays a role in kaon 
induced showers. Moreover, in the case of protons only 
the kinetic energy is available for producing calorimeter 
signals, and for the same beam momentum this is smaller 
than the corresponding pion energy. Therefore, as neu¬ 
tral beams are not available, test beam data with tagged 
protons and charged kaons are of particular interest. 


For the study of proton and kaon showers, single or 
differential Cerenkov counter information was used. For 
tungsten data, the purity was better than 99% for pro¬ 
ton data at beam energies below 10 GeV, and better 
than 80% for protons and kaons at higher energies. For 
the case of steel absorber (Bilki et al 2015a), the pro¬ 
ton purity ranged from 64 to 95% and the pion admix¬ 
ture is corrected for. The agreement of simulations with 
data for shower shapes of protons and kaons is equal or 
better than for pions. The ratio of the response to pro¬ 


tons relative to that for pions, p/n = EpjE^ is shown 
in Fig. [73| together with data from two other scintillator 
steel hadron calorimeters, the CDF end plug calorime¬ 


ter (Liu 1997) and the ATLAS Tile HCAL (Adragna 


et al 


2010). It exhibits the expected slight decrease at 


low energies for steel, attributed to the smaller available 
energy, reduced electromagnetic content and the non¬ 
compensating nature of the calorimeter. A similar be¬ 
haviour was observed in beam tests with a CMS barrel 
calorimeter prototype (Abdullin et al 2009). In the case 
of tungsten (CALICE |2013c ) with its nearly compensat¬ 
ing behaviour, this is less pronounced, see Fig. For 
kaons, no differences were expected within experimental 
precision, and were not seen in data. The resolutions are 
very similar for all hadron types. 


D. Charged track segments in hadronic showers 

Highly granular calorimeters offer an excellent oppor¬ 
tunity to study the internal structure of a hadronic 
shower. Hadronic showers are not amorphous clouds 
of spatially varying energy density, but they exhibit a 
rich sub-structure. The cascade evolution leads to a 
tree-like structure, with partially visible and invisible 
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FIG. 70 Energy-weighted mean longitudinal position (top) 
and radius (bottom) of pion showers as a function of 


beam momentum, for dat a and FTFP-BERT simulations. 
From (lAdloff et a/.l|2013b). 


branches. There are centres of dense activity - due to 
electromagnetic sub-showers, with size characterised by 
Xq, and short-ranged nuclear evaporation products - lo¬ 
cated around points of hard interactions, and regions 
with sparse signals in-between, in which leading inter¬ 
action products travel distances of the order of a nuclear 
interaction length, leaving only a MIP-like signature, or 
none at all. Particle flow algorithms make use of this 
sub-structure, for example by using the pointing infor¬ 
mation of tracks to or from clusters of energy deposi¬ 
tion. It is thus highly relevant to ask whether this is ade¬ 
quately modelled in the simulations, and highly granular 
calorimeters are needed to study this. 

For the study using the AHCAL (Adloff et al. 2013a), 
tracks are searched for using a nearest-neighbour type 
algorithm on isolated hits, and Hough transformations 
in a subsequent filter stage in order to reject unphysical 
tracks with kinks or jumps. It was shown that the num¬ 
ber of tracks found with this technique is indeed corre¬ 
lated with the number of hard charged tracks produced 
in the simulated interactions. The distribution of the 
track multiplicity for 25 GeV pion showers is shown in 
Fig. [74] and compared to different physics lists. The bot¬ 
tom figure displays the evolution of the mean multiplicity 
with shower energy. Here, the primary track of the in¬ 
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FIG. 71 Energy-weighted longitudinal (top) and radial (bot¬ 
tom) pion shower profiles f or W-AHGAL data and differen t 
Geant4 physics lists. From ( GALIG^|2013c| Sicking] 2013] ). 


coming hadron has been excluded, and only secondary 
tracks starting in the third layer or later are shown. The 
multiplicity is well reproduced by recent physics lists, in 
particular QGSP-BERT, while the older parameterisa- 
tions (LHEP, also used in QGSP-BIC) fail to reproduce 
the showers at this level of detail. A similar conclusion is 
drawn from the distribution of track lengths. Although 
there are still imperfections visible, this agreement is re¬ 
markable and a triumph for the modern, theory-driven 
shower models. 


A similar study has been made, using the RPC- 
SDHCAL, in which the secondary tracks of the in- 
ternuclear cascade of the hadronic showers are recon¬ 
structed (CALICE 2013f) After separating these charged 
hadrons from the electromagnetic core or more generally 
from the interaction region, tracks are reconstructed by 
means of the Hough Transformation. The mean number 
of reconstructed tracks is shown in Fig. as a function 
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FIG. 72 Distribution of energy-weighted mean longitudinal 
position (top) and radius (bottom) of pion showers f or W 
AHCAL data and different Geant4 physics lists. From (GAL- 
ICE| |2013c| |Sicking| . 


of energy. 

As expected it increases with increasing energies. The 
logarithmic increase reflects the fact that at higher ener¬ 
gies the electromagnetic fraction of the hadronic shower 
increases which disfavours the production of measurable 
tracks. The number of tracks is adequately reproduced 
by the physics lists of type QGSP and FTFP where the 
data favour slightly the QGSP approach. 

The depth of the prototype allows for measuring the 
length of tracks in the hadronic shower. It is expected 
that a majority of particles travel a distance equivalent 


to about one pion interaction length AJ. Fig. 76 demon¬ 
strates that this is indeed the case. The distribution of 
the track lengths is adequately described by all tested 
Monte Carlo models over a broad energy range. This 
confirms that the basic pion-nucleon cross sections are 
correctly implemented in the Geant4 physics lists. 


E. Shower evolution with time 

The timing capabilities of calorimeters do not only de¬ 
pend on the signal generation in the active part, and 
subsequent processing in the read-out electronics, but 
hadronic showers also exhibit an intrinsic time structure. 



From (CALICE, 2013c) 
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FIG. 73 Ratio of AHGAL response to protons relative to 
pious, as a function of beam mome ntum, for steel (to p) and 
tungsten (bottom) absorber. Fro m (Bilki et al. 2015a| (top), 
( GALIG^|2013c[[Sicking 2013| ) (bottom). 


to delayed nuclear de-excitation processes and the slow 
propagation of low energy neutrons. In experiments at 
colliders with high interaction or background rates, such 


as CLIC, this leads to pile-up. It was shown (Linssen 


et al 2012) in simulations that even under such harsh 


conditions particle flow methods can be successfully ap¬ 
plied, by combining calorimeter topological and timing 
information, and exploiting the high granularity of both. 
Moreover, it has been suggested to apply timing cuts in 
order to suppress neutron-induced hits, which are less 
well correlated in space with the shower axis, by means 
of timing cuts and thus ease the particle flow reconstruc¬ 


tion (Abe et al. 2006). A full validation of particle flow 


must thus include a test whether the time evolution of 
hadronic showers is adequately modelled. 

For this purpose, dedicated add-on tungsten timing 


as observed e.g. in (Caldwell et al. 1993), which is due 


test-beam (T3B) (Simon et al. 2013) instrumentation 
has been constructed and operated together with the W- 
AHCAL, W-DHCAL and Fe-SDHCAL in the SPS beam 
at CERN. Its active part consisted of 15 read-out ele¬ 
ments with 3x3 cm^ active area each, either scintillator 
tiles with MPPCs, or an RPC with 15 pads. They were 
installed behind the last layer of the absorber structures 
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FIG. 74 Track multiplicity distribution for 25 GeV pion 
showers (top), and mean secondary track multiplicity in the 
Fe-AHGAL as a function of beam energy (bottom). The lower 
panels show the residuals of the comparisons with different 
physics lists. Both from ( Adloff et a/.[ |2013a] ). 


(/) 7^ 

CO - 

3 : 

^ 5- 


o 

o 

CD 






nr 


FTFP_BERT_HP 

QGSP_BERT_HP 

LHEP 

SDHCAL DATA 


1 - 


From (CALICE, 2013f) 


iliiii h 




iliiii h 


(a) 



FIG. 75 Mean number of reconstructed tracks in hadronic 
showers in the SDHGAL as a function of the beam en¬ 
ergy (a) and the ratio between simulation and data (b). 
Black crosses correspond to data. Red squares corre¬ 
spond to FTFP_BERT_HP physics list, blue triangles to 
QGSP_BERT_HP and green triangles to LHEP. The digitizer 
used in the simulation wa s tuned using hadron data as ex - 
plained in the text. Erom (GALIGE 2013f Laktineh 2013). 


and aligned radially from the beam axis to the outer part 
of the HCAL. In both cases the read-out was performed 
by custom pre-amplifiers and fast digitisation electron¬ 
ics with 1.25 GHz sampling frequency and large buffer 
depth. Pulse shape analysis allowed the reconstruction 
of the time of the first hit with nano-second accuracy. 
Due to the fine segmentation the probability of a second 
hit is small, so using the first does not introduce a bias. 
In the case of the AHCAL, the T3B read-out was syn¬ 
chronised with that of the main calorimeter, such that 
using events with the shower start reconstructed in dif¬ 
ferent layers, the timing could be investigated not only 
as a function of shower radius, but also of shower depth. 

Fig. shows the time distributions measured with 
scintillator for hadron showers in tungsten and steel ab¬ 
sorber (Adloff et al 2014b). The distribution for muons 


is also shown, to illustrate the time resolution of the set¬ 
up, including effects of trigger, sensor, electronics and re¬ 
construction. In contrast to the prompt muon response 
that for hadrons exhibits sizeable delayed contributions, 
which are significantly more pronounced for tungsten 
than for steel. This was expected from simulation, due to 
the higher abundance of neutrons in showers in tungsten. 
The late hits in tungsten are also more energetic (several 
MIPs) than in steel, where most are close to the thresh¬ 
old of 0.5 MIP. The bottom part of the figure shows a 
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FIG. 76 The track length in the GALIGE SDHGAL for 
40 GeV hadron showers compared with predictions from var¬ 
ious Geant4 physics lists. Erom (|GALICE| |2013f| |Laktineh 
20131. 


comparison of the time spectrum measured with scintil¬ 


lator and with an RPC (CALICE 2013e). In the region 


between 10 and 50 ns, the delayed component is much 
smaller for the gaseous detector in the same absorber, 
which is attributed to the much reduced sensitivity to 
neutrons, since the hydrogen content and density is very 
small compared to that of plastic scintillator. 
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FIG. 77 Distributions of the time of first hit for hadrons 
and muons, measured with scintillator in steel and tungsten 
(top), and comparison between scintillator and gas for tung¬ 
sten (bottom). Fro m (Adloff et al. 2014b| (top) and (CAL¬ 


ICE 


2013e 


Simon 2013| ) (bottom). 


In Fig.j^the timing measurements are compared with 
simulations. They show for tungsten the average time 
of the first hit as a function of radial distance from the 
beam axis. The delayed component becomes more im¬ 
portant at large radius and, as could be shown for tung¬ 
sten, in the rear part of the shower. The QGSP-BERT 
physics list with the designation HP (for high precision), 
and also the QBBC list, include a specialised treatment 
of neutron transport and a detailed implementation of 
energy-dependent absorption cross sections. In the case 
of tungsten, only these lists are able to reproduce the 
data reasonably, while the QGSP-BERT list without this 
treatment grossly overestimates the delayed component. 
In the case of steel, all lists give an acceptable descrip¬ 
tion. 


VII. TESTS OF PARTICLE FLOW ALGORITHMS 

The performance of a particle flow detector depends 
on its overall design parameters, like radius or magnetic 
field strength, and crucially on the calorimeter system. 
The main driving parameters are its single hadron energy 
resolution, at low jet energies, and the charged hadron- 
neutral hadron separation power for more energetic jets 
with higher particle density. More precisely, this means 
the capability of properly assigning energy depositions in 
the calorimeters to either charged hadrons measured in 



EIG. 78 Mean time of first hit as a function of radius, mea¬ 
sured with scintillator for hadrons in tungsten, for data and 


different physics lists. Erom (Adloff et al 2014b). 


the tracking system, or reconstructing them as neutral 
hadron showers. Mis-assignments, generally referred to 
as confusion^ lead to non-gaussian degradations of the 
jet energy resolution, due to double-counting of charged 
shower fragments misinterpreted as neutral particles, or 
losses of neutral hadron energy erroneously assigned to 
a charged track. Photon-hadron confusion is also im¬ 
portant, but simpler to resolve, due to the more com¬ 
pact photon showers and the more effective use of shower 
shape criteria for their identification. This is, for ex¬ 
ample, demonstrated in (Jeans et al. 2012) that uses 
the GARLIC algorithm for photon identification. The 
particle flow algorithm minimises confusion by a set of 
successively applied algorithms, each compensating for 
shortcomings of previous stages and refining the result, 
thereby making use of details of the shower topology. It is 
of high interest to test these algorithms on real data, be¬ 
cause, even with the very good agreement between data 
and simulations presented in the previous sections, it is 
in principle still possible that the reconstruction makes 
use of so far untested features of the shower, or of the 
detector modelling in the simulation. 

In order to illustrate the topological situation to re¬ 
solve, Fig. shows distributions of particle momenta 
and distances for two classes of linear collider events: 
e+e“ Z ^ qq with q = u^d^s and e+e“ ^ WWz/P ^ 
4/ where / denotes quarks or leptons. The first is rep¬ 
resentative for the decay of heavy bosons produced near 
threshold, the second for a more complex event topol¬ 
ogy at higher energy and with more energetic jets. The 
spectra have been obtained (Brianne 2014) using simula¬ 
tions of the ILD detector (Behnke et al. 2013) at the ILC, 
and the distances have been measured at the front face 
of the electromagnetic calorimeter, after propagating the 
particles in the 3.5T magnetic field. They show that typ- 
cial single particle energies are around 10 GeV, and that 
distances between neighbouring particles change consid¬ 
erably with centre-of-mass or jet energy. While for the 
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decays of Z bosons at rest particles are well separated and 
showers will rarely overlap, at higher energies the density 
is higher, and distances of a few centimetres need to be 
resolved. These typical situations, which are relevant for 
the contribution of confusion to the jet energy resolution, 
have been studied using test beam events. 




FIG. 79 Top: Momentum distribution for charged particles 
in simulated Z ^ uds and WW^ 4/ events. Bottom: Dis¬ 
tribution of distances to closest charged track for neutrals 
produced in such events, measured at the front face of the 
electromagnetic calorimeter of the ILD detector. 


A. Event overlay method 


The method to apply particle flow algorithms to test 


beam data was first used (Morgunov and Raspereza 


2004) in the framework of optimising the granularity of 
the CALICE AHCAL prototype with simulations. It 
uses event mixing techniques to overlay two test-beam 
events on top of each other and to study the reconstruc¬ 
tion performance as a function of the transverse sepa¬ 
ration between them. This is possible since, due to the 
high granularity, the mean occupancy per event, and the 
fraction of calorimeter cells shared by two showers, is 
small. As it is possible to reconstruct the shower starting 
point, the charged track inside the calorimeter pointing 
to it can be removed from the event in software, such 


that a pseudo-neutral hadron shower is constructed from 
charged hadron beam data. 

For the study with real data (Adloff et al. 2011c), data 
taken with the CALICE silicon tungsten ECAL and scin¬ 
tillator steel AHCAL taken at the CERN SPS have been 
used. Beam energies of 10 and 30 GeV have been cho¬ 
sen as being representative for particles in 100 GeV jets. 
Pure pion samples are selected using Cerenkov counter 
information. In order not to dilute the energy measure¬ 
ment with leakage effects, events with more than 5% 
of the visible energy in the TCMT are rejected. This 
suppresses showers starting late in the HCAL and thus 
reduces the average longitudinal separation of showers, 
which makes the separation harder and the results more 
conservative. 

Pseudo-neutral particles are constructed and laterally 
displaced by a distance between 5 and 30 cm, using the 
track in the upstream wire chamber as reference. The 
showers were mapped onto the ILD detector model with 
the same longitudinal segmentation as the ECAL and 
AHCAL test beam prototype, and 1x1 cm^ and 3x3 cm^ 
transverse cell size in ECAL and HCAL, respectively. 
Pairs consisting of one displaced pseudo-neutral and one 
charged shower were positioned such that the particles 
had normal incidence to a barrel octant. Hit energies of 
the few shared cells were added. The possibility that two 
hits below threshold add up to one hit exceeding it was 
neglected, since in simulations it contributes only 0.1% to 
the shower energy. The Pandora algorithm was slightly 
modified and adapted to this configuration, for example 
photon identification was skipped, and the track helix 
was replaced by a straight line, since there was no mag¬ 
netic field. Simulated test beam events were subjected 
to exactly the same procedure as data. 


B. Shower reconstruction 

The effect of confusion is quantified by studying the de¬ 
terioration of the neutral particle measurement induced 
by the presence of a nearby charged particle shower. This 
is measured by comparing the reconstructed energy of 
the neutral with the energy of the original undisturbed 
calorimetric measurement, before the overlay procedure. 
The neutral energy is the one obtained by Pandora in ad¬ 
dition to that of the precisely known charged track. This 
deterioration is shown in Eig.j^for a neutral of 10 GeV, 
for two different energies of the charged particle, and two 
different distances between charged and neutral. Here 
the electromagnetic energy scale is used. On that scale a 
10 GeV charged shower has a mean reconstructed energy 
of 8.2 GeV, and a neutral of 7.6 GeV, with the differ¬ 
ence due to the energy deposited by the removed primary 
track. The worst deterioration occurs for a high energy 
(30 GeV) shower at close (5 cm) distance (bottom left). 
The left peak corresponds to the situation where most 
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of the neutral energy is wrongly assigned to the charged 
track. This is more likely to happen if the charged energy 
fluctuates downwards, because Pandora makes use of the 
expected shower energy, based on the tracking informa¬ 
tion, in addition to the topological criteria. Still, even at 
so close distance, there is still a large fraction of neutrals 
reconstructed with little deviation from the un-perturbed 
measurement. The case where the charged track showers 
in the ECAL and the neutral in the HCAL is much more 
difficult to resolve than in the reverse situation, where the 
primary track seen in the ECAL helps to disentangle the 
showers. The confusion for close distance (left plots) is 
considerably reduced for smaller charged particle energy, 
and largely disappears at larger distance. 

The simulated distributions are superimposed. The 
more recent QGSP-BERT physics list describes all four 
situations remarkably well. In contrast, the out-dated 
parameterisation-based LHEP list, which also tends to 
predict too narrow showers, underestimates the confusion 
effects. 


C. Study of confusion effects with data 


In order to quantify the effect of confusion as a function 
of particle separation, the root mean square (RMS) dif¬ 
ference of recovered energy versus the original measure¬ 
ment, as plotted in Eig. was calculated for different 
values of the distance between charged and neutral parti¬ 
cles. (Note that this is not the RMS width of the distribu¬ 
tion, but the RMS deviation from zero, which is larger). 
In order not to overemphasise the tails, mostly for large 
distance, the RMSgo - the RMS of the central part con¬ 
taining 90% of the entries - is shown here; see Eig. 81 
The simulation based on QGSP-BERT physics also de¬ 
scribes the dependency on distance very well, while the 
LHEP list generally underestimates the confusion. RMS 
variations of several GeV are non-negligible in relation 
to the aimed-at jet energy performance at a linear col¬ 
lider. However, the impact on the average jet resolution 
is moderate, since the abundance of 30 GeV particles in 
100 GeV jets is small and the probability for very small 
distances is low. 

Another way to quantify the effect is to calculate the 
fraction of neutral particles measured with an energy that 
is consistent with the nominal value within errors given 
by the calorimetric resolution cr, which in this study is 
55% times ^/l^ GeV for data. This fraction, for a ±3cr 
interval, is shown as a function of the distance to the 
nearby shower in Eig. The absolute values of the 
probabilities at all distances are very well described by 
the recent physics list. The picture is qualitatively the 
same if a narrower ±2cr window is chosen. In that case, 
the probabilities approach 55% and 35% at zero distance, 
for overlaid 10 GeV and 30 GeV charged showers, respec¬ 
tively. The confusion effect decreases as the energy of the 


neutral particle increases. Eurther studies with the test 
beam data have shown that the RMS deviation shows lit¬ 
tle dependence on energy, and since the absolute energy 
resolution increases with a/E, the probability to recover 
the shower energy within ±3cr increases as well, i.e. the 
hadron energy resolution becomes the dominating effect. 


The size of the confusion effects measured here are a 
conservative approximation of those in a collider detec¬ 
tor. The absence of a magnetic field, which on average 
separates the rear parts of the showers more from each 
other than the early parts, the requirement of contain¬ 
ment in the HGAL, the larger tiles in the outer parts of 
the prototype and the distortions due to mapping on the 
uniformly fine granularity of the ILD detector all make 
the task of shower separation more difficult. However, in 
this study they affect data and simulation in the same 
way and do not devalue the conclusion on the excellent 
agreement between them for the more recent physics list. 
The simulation studies on the jet energy performance of 
the ILD detector (Thomson 2009) have been performed 
using different physics lists and have shown that the 
jet energy resolution is rather robust against the choice 
of the model. The difference between the physics lists 
demonstrates that the performance of the algorithm and 
the study made here are indeed sensitive to details of 
the shower modelling, therefore the agreement with the 
QGSP-BERT model is a very significant and non-trivial 
result. It also shows that there are no hidden or not sim¬ 
ulated effects in the detector hardware or its calibration, 
which would degrade the performance with respect to the 
simulated one. 


VIII. SUMMARY AND OUTLOOK 


Research in particle flow calorimetry performed in the 
past decade has significantly enhanced our capabilities 
to precisely measure the energy of jets in high energy 
collider experiments. Detailed simulation studies in the 
framework of preparing detector concepts for future elec¬ 
tron positron linear colliders such as ILG and GLIG have 
shown that resolutions around 3.5-4% can be realised for 
jet energies from 50 up to 1500 GeV. Such a resolution 
was shown to provide a separation of W and Z bosons 
decaying into pairs of jets on the basis of the di-jet in¬ 


variant mass, as shown in Eig. 83 This progress has come 
through a parallel development of sophisticated topolog¬ 
ical shower reconstruction algorithms like the Pandora 
package, and of novel detector technologies with which 
the required granularity can be realised in practice. Ex¬ 
amples of such technologies are silicon pad arrays, silicon 
photo-multipliers on small scintillator cells, or advanced 
gas amplification structures with readout segmented in 
two dimensions. 

A broad experimental effort, driven by the need to de- 
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FIG. 80 Difference between the recovered energy and the measured energy for the 10 GeV neutral hadron at 5 cm (left) and 
at 30 cm (right) from the 10 GeV (top) and 30 GeV (bottom) charged hadrons. Data taken with a combination of SiW EGAD 
and Fe-AHG AL (black) are compared to MG predictions for LHEP (red) and QGSP_BERT (green) physics lists. From ( [Adloff 


et al. 2011 c| ). 


velop calorimetry for the ILC, and mostly organised by 
the CALICE collaboration, has resulted in an experimen¬ 
tal foundation for the particle flow approach. It com¬ 
plements the simulation studies with complete detectors 
and complex collider event topologies; together they es¬ 
tablish this novel approach to calorimetry. Large-scale 
prototypes have been constructed, based on the novel 
technologies proposed for the future linear colliders and 
featuring the same fine granularity in three dimensions 
as used in the simulations of the jet energy performance. 
They have been exposed to test beams with muons, elec¬ 
trons and hadrons in major campaigns at CERN and Fer- 
milab.The accumulated data cover a large energy range 
from a few up to more than 100 GeV and allow testing 
the basic performance - stability, linearity and resolu¬ 
tion - of the prototypes, the adequacy of shower model 
based simulations in great detail, and the performance of 
the algorithm in disentangling complex event topologies 
under the conditions of real data. 

The beam tests started with the silicon ECAL and the 
scintillator-SiPM HCAL technology, and consequently 
the analysis of these data is most advanced. However, all 
technologies have been successfully commissioned, have 
recorded large data sets and, in the results presented 


so far, demonstrated their basic performance. No show 
stoppers were found. Therefore the proof of principle 
based on the full analyses holds for the feasibility of 
particle-flow type reconstruction in fine-grained calorime¬ 
ters in general. The data are available and ensure that 
final performance evaluations, and quantitative valida¬ 
tions of detector and shower physics models can be ex¬ 
pected also for the more recently tested technologies in 
the near future. 

The CALICE data confirm the progress made in recent 
years in improving the theory-motivated shower models 
implemented in the Geantd framework, which had been 
driven by test beam studies with LHC calorimeter pro¬ 
totypes, and they provide a basis for further refinements. 
Electron beams have been used to validate the under¬ 
standing of the detector response itself in terms of sim¬ 
ulation, since electromagnetic showers can be precisely 
predicted. Overall, the more recent hadron shower mod¬ 
els reproduce the pion and proton data in the relevant 
energy range with an accuracy of a few percent. Shower 
shapes are well described in terms of average depth, while 
their width remains somewhat under-estimated as ener¬ 
gies increase. Above around 50 GeV the shower core 
is too pronounced in simulations, both in longitudinal 
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Distance between shower axes [mm] 


FIG. 81 RMS 90 deviations of the recovered energy of neu¬ 
tral 10 GeV hadrons from its measured energy vs. the dis¬ 
tance from charged 10 GeV (circles and continuous lines) 
and 30 GeV (triangles and dashed lines) hadrons for beam 
data taken with a combination of SiW EGAL and Fe-AHGAL 
(black) and for Monte Garlo simulated data, for both LHEP 
(red) and QGSP_BERT (green) physics lists. From (Adloff 
et al. ] |2011c| ). 



Distance between shower axes [mm] 


FIG. 82 Probability of neutral 10 GeV hadrons energy recov¬ 
ering within 3 standard deviations from its real energy vs. the 
distance from charged 10 GeV (circles and continuous lines) 
and 30 GeV (triangles and dashed lines) hadrons for beam 
data taken with a combination of SiW EGAL and Fe-AHGAL 
(black) and for Monte Garlo simulated data, for both 
(red) and QGSP_BERT (green) physics lists. From 
et aL\ |2011c| ) 


LHEP 

Adloff 


and radial extension. Despite these remaining imperfec¬ 
tions, the recent models describe the details of the show¬ 
ers remarkably well. The accuracy of the hadronic shower 
modelling has reached a level which qualifies them as a 
tool for quantitative detector optimisation. 

The studies probe the simulations in more detail than 


just average response and shapes, and the high granular¬ 
ity has been used extensively. Shower profiles were mea¬ 
sured from the reconstructed location of the first hard 
interaction and thus provide more sensitivity to the dif¬ 
ferent components relevant for the different phases of the 
evolution in EGAL and HCAL. The fine segmentation is 
an ideal basis for software compensation techniques and 
was quantitatively studied: it improves the resolution by 
up to 20%, and a stochastic term of about 45% for the 
hadronic energy resolution shows that highly granular 
calorimeters can have competitive energy measurement 
performance. The reconstructed starting point also en¬ 
ters into topology-based leakage estimation, a novel tech¬ 
nique to further improve resolutions at higher energies, 
based on the fine spatial information. 

The CALICE results include a first demonstration 
that digital hadron calorimetry, based on hit counting 
only, works in principle, both conceptually and techno¬ 
logically, as demonstrated with prototypes with half a 
million channels. The stochastic contributions to the 
achieved resolution is comparable to that obtained with 
analogue methods. The constant term appears to be af¬ 
fected by saturation effects at higher energies, and it was 
shown that the semi-digital method could mitigate these 
by combining the information from multiple thresholds. 
Quantitatively understanding the high-energy response 
in terms of simulations and disentangling instrumental 
effects from those due to high particle densities is a chal¬ 
lenge which is being actively addressed in the analysis of 
the existing data. These studies will form a basis for the 
optimisation of read-out granularity and number of bits. 

Studies using tungsten as HCAL absorber extend the 
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FIG. 83 Reconstructed invariant mass distribution for the 


simulated in the ILD detector. From (Thomson 2009). 
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reach of the validation to applications at multi-TeV ener¬ 
gies, such as at CLIC, where denser absorbers are consid¬ 
ered. They probe the shower models in a regime where 
neutrons play a more important role than in steel, and 
the adequacy of the simulations is shown to be equally 
good as for the lighter material. Using a minimal but 
well conceived set-up with a few cells equipped with fast 
sampling electronics, a first look into the time evolution 
of hadron showers became possible. The results show 
that this is well modelled but that in tungsten the de¬ 
tailed simulation of neutron transport - so-called high 
performance versions of physics lists - is necessary. They 
support the simulated particle flow performance at CLIC 
energies, where both spatial and timing information are 
critical for pile-up rejection. The tungsten scintillator re¬ 
sults are also relevant for the scintillator ECAL option of 
an ILC detector. 

Particle flow reconstruction makes use of the shower 
topology in greater detail than could be probed with ear¬ 
lier beam tests, for LHC detectors for example. The ob¬ 
served multiplicity of charged tracks visible in the shower 
evolution is quantitatively reproduced by simulations of 
gas and scintillator HCALs. A test of the particle flow 
reconstruction methods was performed by applying the 
Pandora algorithm to test beam events overlaid on each 
other, and quantifying confusion effects by measuring 
the degradation of neutral hadron energy measurement 
through the presence of close-by charged hadron showers, 
and comparing results with real data to those with sim¬ 
ulations. It adds to the realism of the study that it uses 
data taken with the combined set-up of ECAL and HCAL 
and makes use of the fine spatial resolution of both. The 
excellent agreement represents important underlying ev¬ 
idence for the applicability of the method for jet mea¬ 
surements at future colliders. The results obtained so 
far form a cornerstone of the conceptual detector designs 
proposed for the ILC. 

Eurther research will focus on completion of the valida¬ 
tion through test beam data analysis. Important issues 
are the validation of hadronic response simulation in the 
gaseous HCALs and the scintillator ECAL, and the appli¬ 
cation of particle flow algorithms on data taken with al¬ 
ternative combinations of detectors. Data sets are avail¬ 
able to probe the interplay of the the silicon ECAL and 
RPC DHCAL, both with 1 cm^ segmentation, and the 
combination of ECAL and AHCAL both with scintilla¬ 
tor read-out. In parallel, the particle flow reconstruction 
algorithms will be developed further. In particular the 
topological reconstruction in the (semi-) digital HCALs 
will be optimised, using real data. 

The effort to prepare options for detectors at the 
planned international linear collider is now addressing 
the technological challenges of integrating the enormous 
channel densities. Engineering solutions are being de¬ 
veloped that are scalable towards a full collider detector 
whilst maintaining the compactness and minimising the 


dead spaces needed for supports and services. In practice, 
this requires embedding the front end electronics into the 
detector volume and therefore keeping power consump¬ 
tion as low as possible, for example using power cycling 
according to the time structure of the accelerator. Data 
need to be digitised and zero-suppressed in the front end, 
and compressed on the detector. Low material budget so¬ 
lutions need to be developed for power distribution and 
dissipation. Such concepts have been developed for the 
different technologies, and small sets of detector mod¬ 
ules with silicon and scintillator sensors have been suc¬ 
cessfully tested (Behnke et al 2013), but power cycling 
must still be fully established, including operation in in 
high magnetic fields. The SDHCAL test beam prototype 
represents the largest system to test these concepts and 
has been routinely operated with cycled power to reduce 
cooling needs. However, none of the prototypes addresses 
all the challenges yet. Eull scale beam tests, as reported 
here, will also be necessary with scalable technological 
prototypes in order to re-establish performance and cal¬ 
ibration and correction procedures at system level. 

The experience collected through these first steps to¬ 
wards realistic prototypes, together with the established 
confidence into the realism of the simulations in detail 
now provides the basis for a stringent cost-performance 
optimisation. In parallel, the large channel counts also 
require a level of industrialisation which goes beyond that 
of existing large calorimeter systems. This has barely be¬ 
gun, and will influence further developments, too. 

Recently, on the technology frontier, new options have 
appeared which may further enhance the potential of 
fine-grained calorimetry. Examples are monolithic active 
pixel sensors for ultra-flnely segmented electromagnetic 


calorimeters, or digital silicon photomultipliers (Erach 


2012) which may considerably simplify the front end 


readout of scintillator-based calorimeters. These will be 
evaluated and potentially explored in prototypes, simu¬ 
lations and new reconstruction algorithms. 

Particle flow methods are now successfully and rou¬ 
tinely applied at the LHC, with decisive gains in jet 
and missing energy performance. Yet, due to the much 
coarser spatial information in the ECAL and only mini¬ 
mal segmentation in the HCAL, when compared to the 
conceived linear collider detectors, the achieved resolu¬ 
tions are not yet reaching the stringent ILC goals. En¬ 
couraged by the demonstrated performance, and also by 
the successful experiments reported here, fine-grained 
calorimeter solutions are now being considered for the 
upgrades of LHC detectors for the high-luminosity phase. 
This brings additional challenges, both for the recon¬ 
struction software side, due to larger particle densities 
and energies, and pile-up, but also on the hardware side. 
Sensors need to meet higher radiation tolerance levels 
and rate capability needs, and the readout has to cope 
with much larger bandwidth, within tight power budget 
limitations. 
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In all these future directions, algorithms and technolo¬ 
gies need to be developed in parallel, and meet in test 
beam experiments. 
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